LLVM/project 4150a78llvm/include/llvm/Transforms/Vectorize/SandboxVectorizer/Passes VecPassBase.h, llvm/lib/Transforms/Vectorize/SandboxVectorizer/Passes BottomUpVec.cpp TopDownVec.cpp

[SandboxVectorizer] Implement topdown vectorizer

This patch introduces the `top-down-vec` pass to the Sandbox Vectorizer,
adding the ability to traverse use-def chains top-down to discover and
collect vectorization opportunities.

Key changes include:
* TopDownVec Pass: Implemented `TopDownVec` which recursively processes
value bundles top-down, creates vectorization actions (widening, packing,
shuffles), and emits the final vector IR.
* Shared Infrastructure (VecPassBase): Extracted common IR emission logic
out of `BottomUpVec` and into a new shared base class, `VecPassBase`.
Functions for generating vector instructions, handling diamond reuse,
creating shuffles/packs, and collecting dead instructions are now shared
between the bottom-up and top-down vectorizers to prevent code
duplication.
* Pass Registration: Exposed `top-down-vec` in `PassRegistry.def` and
`SandboxVectorizerPassBuilder`, allowing it to be invoked within pass
pipelines via `opt`.

    [3 lines not shown]
DeltaFile
+2-281llvm/lib/Transforms/Vectorize/SandboxVectorizer/Passes/BottomUpVec.cpp
+262-0llvm/lib/Transforms/Vectorize/SandboxVectorizer/Passes/TopDownVec.cpp
+257-0llvm/lib/Transforms/Vectorize/SandboxVectorizer/Passes/VecPassBase.cpp
+121-0llvm/include/llvm/Transforms/Vectorize/SandboxVectorizer/Passes/VecPassBase.h
+94-0llvm/test/Transforms/SandboxVectorizer/topdown_vec.ll
+68-9llvm/test/Transforms/SandboxVectorizer/pack.ll
+804-2907 files not shown
+923-34613 files

LLVM/project b0d6d79llvm/lib/Target/AMDGPU AMDGPUISelDAGToDAG.cpp, llvm/test/CodeGen/AMDGPU packed-fp64.ll

[AMDGPU] Select fneg modifier for v2f64 instructions
DeltaFile
+28-17llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
+12-23llvm/test/CodeGen/AMDGPU/packed-fp64.ll
+40-402 files

NetBSD/pkgsrc-wip 2cf9fac. Makefile, py-toposort Makefile PLIST

py-toposort: import 1.10
DeltaFile
+18-0py-toposort/Makefile
+10-0py-toposort/PLIST
+5-0py-toposort/distinfo
+4-0py-toposort/DESCR
+1-1py-xsdata/Makefile
+1-0Makefile
+39-16 files

LLVM/project c91c9fcllvm/include/llvm/Support Allocator.h, llvm/unittests/Support AllocatorTest.cpp

Reland [Allocator] Keep bump pointer at a minimum alignment (#205240)

Reland #203718 (reverted in #205091) by making computation in integer
domain to avoid UB (nullptr + non-zero offset).

Add a `MinAlign` template parameter (default 8, sizeof(size_t) on 64-bit
platforms) so that the common case `Alignment <= MinAlign` can skip
realigning `CurPtr`.

This is achieved by rounding each allocation's size up to MinAlign, so
the bump pointer stays MinAlign-aligned between allocations.

SpecificBumpPtrAllocator::DestroyAll() walks objects at a fixed
sizeof(T) stride and needs tight packing, so it uses MinAlign=1.
(alignof(T) would
pack just as tightly and reuse the default instantiation, but T may be
incomplete here, e.g. `SpecificBumpPtrAllocator<MCSectionELF>`.)

Its `Allocate` still skips the realign: the slab is max_align_t-aligned

    [9 lines not shown]
DeltaFile
+41-17llvm/include/llvm/Support/Allocator.h
+19-0llvm/unittests/Support/AllocatorTest.cpp
+60-172 files

LLVM/project e2765f3llvm/lib/Transforms/IPO OpenMPOpt.cpp, llvm/test/Transforms/Attributor/reduced openmp_opt_constant_type_crash.ll

[OpenMPOpt][Attributor] Selectively seed deglobalization AAs (#198710)

This addresses a compile-time issue observed on a large generated C++
translation unit compiled with `-fopenmp`.

The source code is not OpenMP-heavy. It mainly consists of generated
function-registration wrappers, template instantiations, lambdas, and
small helper functions. However, because the TU is compiled with OpenMP
enabled, `OpenMPOptCGSCCPass` runs and drives Attributor on a module
with many functions.

`OpenMPOpt::registerAAsForFunction` currently eagerly creates the
deglobalization AAs for every function in OpenMP device modules:

* `AAHeapToShared`
* `AAHeapToStack`

Most generated wrapper/helper functions in the motivating workload do
not contain `__kmpc_alloc_shared`, removable allocations, or free-like

    [25 lines not shown]
DeltaFile
+34-13llvm/test/Transforms/Attributor/reduced/openmp_opt_constant_type_crash.ll
+34-10llvm/lib/Transforms/IPO/OpenMPOpt.cpp
+3-3llvm/test/Transforms/OpenMP/single_threaded_execution.ll
+71-263 files

FreeBSD/ports 282457bdatabases/heidisql distinfo Makefile

databases/heidisql: Update to 12.20

ChangeLog at:   https://github.com/HeidiSQL/HeidiSQL/releases
DeltaFile
+3-3databases/heidisql/distinfo
+1-2databases/heidisql/Makefile
+1-0databases/heidisql/pkg-plist
+5-53 files

FreeBSD/ports 811e2d7sysutils/bareos-server distinfo Makefile, sysutils/bareos-traymonitor Makefile

sysutils/bareos-*: update to 25.0.3

ChangeLog at:   https://github.com/bareos/bareos/releases/tag/Release%2F25.0.3
DeltaFile
+3-3www/bareos-webui/distinfo
+3-3sysutils/py-python-bareos/distinfo
+3-3sysutils/bareos-server/distinfo
+2-2sysutils/bareos-server/Makefile
+1-1sysutils/py-python-bareos/Makefile
+1-1sysutils/bareos-traymonitor/Makefile
+13-131 files not shown
+14-147 files

FreeBSD/ports 8e2a152sysutils/bareos24-client Makefile, sysutils/bareos24-server distinfo Makefile

sysutils/bareos24*: Update to 24.0.10

ChangeLog at:   https://github.com/bareos/bareos/releases/tag/Release%2F24.0.10
DeltaFile
+3-3www/bareos24-webui/distinfo
+3-3sysutils/bareos24-server/distinfo
+2-2sysutils/bareos24-server/Makefile
+0-2sysutils/bareos24-client/Makefile
+1-1sysutils/bareos24-traymonitor/Makefile
+1-1www/bareos24-webui/Makefile
+10-126 files

LLVM/project 77879b4llvm/lib/Target/AMDGPU AMDGPUISelLowering.cpp SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU s-barrier-signal-var-gep.ll

[AMDGPU] Fold constant offsets into named barrier addresses

Allow isOffsetFoldingLegal to fold a constant offset into an LDS
named-barrier global, and include the node offset when materializing the
LDS address in LowerGlobalAddress. s_barrier_signal_var on a GEP'd named
barrier now selects the immediate form, matching a bare global and GlobalISel.
With object linking the offset folds into the relocation addend.

Change-Id: I639bc723eb001573585cc05d0ad19f2773054f21
Assisted-by: Cursor
DeltaFile
+11-5llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+12-1llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-5llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep.ll
+25-113 files

LLVM/project 261d748llvm/test/CodeGen/AMDGPU s-barrier-signal-var-gep.ll

[AMDGPU] Pre-commit test for constant-offset named barrier signal_var

A GEP into a named-barrier array (&bars[1]) lowers s_barrier_signal_var to
the dynamic m0 form on SelectionDAG, unlike the bare global and GlobalISel.
With object linking it emits a runtime add of the offset instead of folding
it into the relocation addend.

Change-Id: I7cea0dd64d050eb3e2143841e7136355cbb3bc50
Assisted-by: Cursor
DeltaFile
+119-0llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep.ll
+119-01 files

FreeBSD/ports 23ff2bcsysutils/bareos23-server distinfo Makefile, sysutils/bareos23-traymonitor Makefile

sysutils/bareos23*: Update to 23.1.7

ChangeLog at:   https://github.com/bareos/bareos/releases/tag/Release%2F23.1.7
DeltaFile
+3-3sysutils/bareos23-server/distinfo
+3-3www/bareos23-webui/distinfo
+1-1sysutils/bareos23-server/Makefile
+1-1sysutils/bareos23-traymonitor/Makefile
+1-1www/bareos23-webui/Makefile
+9-95 files

LLVM/project 86184abllvm/lib/Target/AMDGPU AMDGPUISelLowering.cpp SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU s-barrier-signal-var-gep-object-linking.ll s-barrier-signal-var-gep.ll

[AMDGPU] Fold constant offsets into named barrier addresses

Allow isOffsetFoldingLegal to fold a constant offset into an LDS
named-barrier global, and include the node offset when materializing the
LDS address in LowerGlobalAddress. s_barrier_signal_var on a GEP'd named
barrier now selects the immediate form, matching a bare global and GlobalISel.
With object linking the offset folds into the relocation addend.

Change-Id: Ie05b8c8cd127604ff174c423a74340fd2de4e405
Assisted-by: Cursor
DeltaFile
+11-5llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+12-1llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-2llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep-object-linking.ll
+1-2llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep.ll
+26-104 files

LLVM/project b820eb7llvm/test/CodeGen/AMDGPU s-barrier-signal-var-gep.ll s-barrier-signal-var-gep-object-linking.ll

[AMDGPU] Pre-commit test for constant-offset named barrier signal_var

A GEP into a named-barrier array (&bars[1]) lowers s_barrier_signal_var to
the dynamic m0 form on SelectionDAG, unlike the bare global and GlobalISel.
With object linking it emits a runtime add of the offset instead of folding
it into the relocation addend.

Change-Id: I59f0e6fe6a72b4c96c8efb926610f7f2d3833e38
Assisted-by: Cursor
DeltaFile
+59-0llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep.ll
+40-0llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep-object-linking.ll
+99-02 files

LLVM/project d853c05clang/include/clang/CIR/Dialect/Builder CIRBaseBuilder.h, clang/lib/CIR/CodeGen CIRGenBuiltin.cpp CIRGenExpr.cpp

[CIR] Add support for __builtin_nontemporal_store and __builtin_nontemporal_load (#197872)

Add nontemporal attribute to cir.load and cir.store ops.
DeltaFile
+77-0clang/test/CIR/CodeGenBuiltins/builtin-nontemporal.cpp
+12-8clang/lib/CIR/Dialect/Transforms/FlattenCFG.cpp
+18-2clang/lib/CIR/CodeGen/CIRGenBuiltin.cpp
+8-11clang/lib/CIR/CodeGen/CIRGenExpr.cpp
+9-7clang/include/clang/CIR/Dialect/Builder/CIRBaseBuilder.h
+7-5clang/lib/CIR/CodeGen/CIRGenBuilder.h
+131-3310 files not shown
+161-4716 files

LLVM/project 1cbfe8bllvm/include/llvm/IR GlobalValue.h, llvm/include/llvm/Transforms/Utils AssignGUID.h

Reland #184065
DeltaFile
+61-17llvm/lib/Bitcode/Reader/BitcodeReader.cpp
+45-30llvm/lib/LTO/LTO.cpp
+64-2llvm/lib/IR/Globals.cpp
+49-3llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
+45-5llvm/include/llvm/IR/GlobalValue.h
+49-0llvm/include/llvm/Transforms/Utils/AssignGUID.h
+313-57120 files not shown
+872-416126 files

LLVM/project 9b228b5llvm/lib/Transforms/IPO ThinLTOBitcodeWriter.cpp WholeProgramDevirt.cpp, llvm/test/ThinLTO/X86 devirt_function_alias2.ll

[CFI] Create an external linkage alias instead of promoting internals
DeltaFile
+20-33llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
+20-5llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
+10-7llvm/test/Transforms/ThinLTOBitcodeWriter/comdat.ll
+16-0llvm/lib/Transforms/IPO/LowerTypeTests.cpp
+6-4llvm/test/ThinLTO/X86/devirt_function_alias2.ll
+4-2llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll
+76-513 files not shown
+83-569 files

FreeBSD/ports 616fe5bx11-drivers/xlibre-xf86-input-joystick Makefile, x11-drivers/xlibre-xf86-input-keyboard Makefile

x11-server/xlibre-server: update to 25.1.8

- Bump PORTREVISION of xlibre-xf86-* ports

ChangeLog at:   https://github.com/X11Libre/xserver/releases/tag/xlibre-xserver-25.1.8
With hat:       xlibre
DeltaFile
+3-3x11-servers/xlibre-server/distinfo
+1-1x11-drivers/xlibre-xf86-input-joystick/Makefile
+1-1x11-drivers/xlibre-xf86-input-keyboard/Makefile
+1-1x11-drivers/xlibre-xf86-input-libinput/Makefile
+1-1x11-drivers/xlibre-xf86-input-mouse/Makefile
+1-1x11-drivers/xlibre-xf86-input-synaptics/Makefile
+8-818 files not shown
+26-2524 files

LLVM/project f519bd9llvm/lib/IR Verifier.cpp, llvm/test/Verifier memprof-metadata-bad.ll

[Verifier] Require !callsite with !memprof metadata (#205053)

Fixes: https://github.com/llvm/llvm-project/issues/181237
DeltaFile
+10-6llvm/test/Verifier/memprof-metadata-bad.ll
+3-0llvm/lib/IR/Verifier.cpp
+13-62 files

Linux/linux 502d801Documentation/filesystems erofs.rst, fs/erofs fscache.c data.c

Merge tag 'erofs-for-7.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/xiang/erofs

Pull erofs updates from Gao Xiang:
 "The most notable change is the removal of the fscache backend: it has
  been deprecated for almost two years, mainly because EROFS file-backed
  mounts and fanotify pre-content hooks (together with erofs-utils) now
  provide better functionality and simpler codebase. In addition,
  fscache has depended on netfslib for years, which is undesirable for
  EROFS since it is a local filesystem. More details in [1].

  In addition, sparse support has been added to the pcluster layout,
  which is helpful for large sparse AI datasets, and map requests for
  chunk-based inodes have been optimized to be more efficient as well.
  There are also the usual fixes and cleanups.

  Summary:

   - Report more consecutive chunks of the same type for
     each iomap request

    [21 lines not shown]
DeltaFile
+0-664fs/erofs/fscache.c
+72-63fs/erofs/data.c
+65-65Documentation/filesystems/erofs.rst
+20-78fs/erofs/super.c
+3-69fs/erofs/internal.h
+20-27fs/erofs/ishare.c
+180-9667 files not shown
+227-1,03013 files

OpenBSD/src xq6WM2Csys/dev/pci/drm/i915 i915_scatterlist.c

   don't increment scatterlist length twice

   this occurs as sg_dma_len() returns the length member of struct scatterlist
   where as on x86 linux it returns a dma_length member of the struct

   Problem reported by Ryan Fahy in FreeBSD drm-kmod PR 468.

   Avoids a 'Data modified on freelist' panic on boot when using discrete
   Intel cards (DG2).  DG2 has other issues, so remains disabled for now.
VersionDeltaFile
1.6+4-0sys/dev/pci/drm/i915/i915_scatterlist.c
+4-01 files

LLVM/project 8995486llvm/include/llvm/IR IntrinsicsRISCV.td, llvm/lib/Target/RISCV RISCVISelLowering.cpp RISCVInstrInfoP.td

[RISCV][P-ext] packed exchanged add/sub codegen (#203473)

Wire up the already-defined exchanged add/sub instructions
pas/psa/psas/pssa/paas/pasa with llvm.riscv.* intrinsics and isel
patterns.
DeltaFile
+174-0llvm/test/CodeGen/RISCV/rvp-simd-64.ll
+68-2llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+54-0llvm/test/CodeGen/RISCV/rvp-simd-32.ll
+24-0llvm/lib/Target/RISCV/RISCVInstrInfoP.td
+8-0llvm/include/llvm/IR/IntrinsicsRISCV.td
+328-25 files

LLVM/project 677d378llvm/lib/Transforms/IPO ThinLTOBitcodeWriter.cpp WholeProgramDevirt.cpp, llvm/test/ThinLTO/X86 devirt_function_alias2.ll

[CFI] Create an external linkage alias instead of promoting internals
DeltaFile
+20-33llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
+20-5llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
+10-7llvm/test/Transforms/ThinLTOBitcodeWriter/comdat.ll
+16-0llvm/lib/Transforms/IPO/LowerTypeTests.cpp
+6-4llvm/test/ThinLTO/X86/devirt_function_alias2.ll
+4-2llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll
+76-513 files not shown
+83-569 files

LLVM/project 4bf5379llvm/include/llvm/IR GlobalValue.h, llvm/include/llvm/Transforms/Utils AssignGUID.h

Reland #184065
DeltaFile
+61-17llvm/lib/Bitcode/Reader/BitcodeReader.cpp
+45-30llvm/lib/LTO/LTO.cpp
+64-2llvm/lib/IR/Globals.cpp
+49-3llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
+45-5llvm/include/llvm/IR/GlobalValue.h
+49-0llvm/include/llvm/Transforms/Utils/AssignGUID.h
+313-57119 files not shown
+858-410125 files

LLVM/project b0bd945compiler-rt/lib/instrumentor-examples/precision-analysis precision_analysis_runtime.cpp CMakeLists.txt, compiler-rt/test/instrumentor-examples precision_fp16_overflow.c precision_detailed.c

[Instrumentor] Add runtime examples: [2/N] A FP precision analysis

Second example:
Check all floating point operations and track if they could be done at
lower precision.

Partially developped by Claude (AI), tested and verified by me.
DeltaFile
+616-0compiler-rt/lib/instrumentor-examples/precision-analysis/precision_analysis_runtime.cpp
+92-0compiler-rt/test/instrumentor-examples/precision_fp16_overflow.c
+76-0compiler-rt/test/instrumentor-examples/precision_detailed.c
+68-0compiler-rt/lib/instrumentor-examples/precision-analysis/CMakeLists.txt
+66-0compiler-rt/test/instrumentor-examples/precision_mixed.c
+56-0compiler-rt/test/instrumentor-examples/simple_precision.c
+974-05 files not shown
+1,051-011 files

FreeBSD/src cf523f4sys/fs/nfs nfsclstate.h nfs_var.h, sys/fs/nfsclient nfs_clrpcops.c nfs_clstate.c

nfscl: Add support for flexible file layout striping

Commit 72e57bc26417 added support for striping to the pNFS
server configuration.  This patch adds support for striping
to the NFS client.

For striped flexible file layouts, an extra structure
must be malloc()d for each stripe, since the number
of stripe servers can vary from one mirror to another.
This new structure is called nfsffs and a single one
of these structures is in the nfsffm structure so that
the non-striped layouts can avoid the additional malloc()'s.

This patch only affects NFSv4.1/4.2 mounts that use the
"pnfs" mount option against servers that support the
flexible file layout.
DeltaFile
+202-105sys/fs/nfsclient/nfs_clrpcops.c
+24-12sys/fs/nfsclient/nfs_clstate.c
+8-2sys/fs/nfs/nfsclstate.h
+1-1sys/fs/nfs/nfs_var.h
+235-1204 files

LLVM/project dfe8b22llvm/lib/Target/BPF BPFSelectionDAGInfo.cpp

[BPF] Increase BPFMaxStoresPerMemFunc from 128 to 192 (#205222)

With commits [1] and [2], memory operations like memcpy/memmove lower to
a sequence of loads/stores whose width is the minimum of the source and
destination alignment, and the store count is bounded by
BPFMaxStoresPerMemFunc. For 1-byte alignment, the maximum copy length
that can be inlined is therefore 128 bytes.

This may regress cases that previously inlined. Consider a memcpy with
src alignment 8, dst alignment 1 and size 136. After [1]/[2], the store
width is the minimum alignment (1 byte), so the store count is 136,
which exceeds the 128 limit and the copy falls back. Before [1]/[2], the
store count was computed with a fixed 8-byte unit regardless of the
actual alignment (each unit expands to 8 one-byte stores when the
minimum alignment is 1), so the total count was only 17 (136/8 < 128)
and the copy was inlined.

Raise the limit from 128 to 192 to mitigate. Alternatively, users can
increase alignment to avoid the regression.

    [2 lines not shown]
DeltaFile
+1-1llvm/lib/Target/BPF/BPFSelectionDAGInfo.cpp
+1-11 files

LLVM/project 0dddce7compiler-rt/lib/instrumentor-examples/precision-analysis precision_analysis_runtime.cpp CMakeLists.txt, compiler-rt/test/instrumentor-examples precision_fp16_overflow.c precision_detailed.c

[Instrumentor] Add runtime examples: [2/N] A FP precision analysis

Second example:
Check all floating point operations and track if they could be done at
lower precision.

Partially developped by Claude (AI), tested and verified by me.
DeltaFile
+569-0compiler-rt/lib/instrumentor-examples/precision-analysis/precision_analysis_runtime.cpp
+91-0compiler-rt/test/instrumentor-examples/precision_fp16_overflow.c
+76-0compiler-rt/test/instrumentor-examples/precision_detailed.c
+68-0compiler-rt/lib/instrumentor-examples/precision-analysis/CMakeLists.txt
+66-0compiler-rt/test/instrumentor-examples/precision_mixed.c
+56-0compiler-rt/test/instrumentor-examples/simple_precision.c
+926-05 files not shown
+1,003-011 files

LLVM/project 1ad5dfdcompiler-rt/lib/instrumentor-examples instrumentor_runtime.h README.md, compiler-rt/lib/instrumentor-examples/flop-counter flop_counter_runtime.cpp README.md

[Instrumentor] Add runtime examples: [1/N] A flop counter

This adds a instrumentor-examples folder into compiler RT to showcase
use cases of the instrumentor. The initial example is a program that,
via instrumentation, counts the number of flops performed.

Partially developped by Claude (AI), tested and verified by me.
DeltaFile
+295-0compiler-rt/lib/instrumentor-examples/instrumentor_runtime.h
+180-0compiler-rt/lib/instrumentor-examples/flop-counter/flop_counter_runtime.cpp
+107-0compiler-rt/lib/instrumentor-examples/flop-counter/README.md
+83-4llvm/lib/Transforms/IPO/Instrumentor.cpp
+74-0compiler-rt/test/instrumentor-examples/lit.cfg.py
+72-0compiler-rt/lib/instrumentor-examples/README.md
+811-412 files not shown
+1,123-518 files

LLVM/project 617fad6llvm/test/CodeGen/AMDGPU s-barrier-signal-var-gep.ll

Apply suggestion from @chinmaydd
DeltaFile
+0-3llvm/test/CodeGen/AMDGPU/s-barrier-signal-var-gep.ll
+0-31 files

FreeBSD/ports 1edbeefeditors/lazarus Makefile

editors/lazarus[qt5|qt6]: Fix build on aarch64
DeltaFile
+3-0editors/lazarus/Makefile
+3-01 files