LLVM/project 7966be4llvm/lib/Transforms/IPO SampleProfileMatcher.cpp, llvm/test/Transforms/SampleProfile pseudo-probe-stale-profile-renaming.ll

Add a flag to disable salvage-unused-profile for large modules.
DeltaFile
+11-0llvm/lib/Transforms/IPO/SampleProfileMatcher.cpp
+4-0llvm/test/Transforms/SampleProfile/pseudo-probe-stale-profile-renaming.ll
+15-02 files

LLVM/project 5b60283llvm/lib/Target/RISCV RISCVTargetTransformInfo.cpp, llvm/test/Analysis/CostModel/RISCV extract-last-active.ll

[RISCV][TTI] Implement cost of llvm.experimental.vector.extract.last.active (#184067)

This patch implements the cost of
llvm.experimental.vector.extract.last.active which will lower to:
 vcpop.m a0, v0
 beqz a0, exit # Return passthru when the entire lane is inactive.
vid v10, v0.t
vredmaxu.vs v10, v10, v10
vmv.v.s a0, v10
zext.b a0, a0
vslidedown v8, v8,

This patch also helps conditional-scalar-assignment (CSA) works for
scalable vector.
DeltaFile
+108-0llvm/test/Analysis/CostModel/RISCV/extract-last-active.ll
+45-0llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
+153-02 files

LLVM/project 1889beallvm/lib/Target/AMDGPU CMakeLists.txt SIRegisterInfo.cpp

Exclude AGPRs from the alignment check.

AGPR spills are lowered to 4-byte access by default,
as we need to scavenge a temporary vgpr for them.
DeltaFile
+7-0llvm/lib/Target/AMDGPU/CMakeLists.txt
+1-1llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
+8-12 files

LLVM/project e663114llvm/lib/Transforms/IPO SampleProfileMatcher.cpp, llvm/test/Transforms/SampleProfile pseudo-probe-stale-profile-renaming.ll

Add a flag to disable salvage-unused-profile for large modules.
DeltaFile
+11-0llvm/lib/Transforms/IPO/SampleProfileMatcher.cpp
+4-0llvm/test/Transforms/SampleProfile/pseudo-probe-stale-profile-renaming.ll
+15-02 files

LLVM/project 04dbbd4llvm/lib/Debuginfod HTTPServer.cpp HTTPClient.cpp, llvm/lib/Support/HTTP HTTPServer.cpp HTTPClient.cpp

[Support] Move HTTP client/server to new LLVMSupportHTTP lib (NFC) (#184572)

Relocate HTTPClient and HTTPServer from the Debuginfod library to
llvm/Support/HTTP so they can be reused by other components.

---------

Co-authored-by: Alexandre Ganea <aganea at havenstudios.com>
Co-authored-by: Jonas Devlieghere <jonas at devlieghere.com>
DeltaFile
+310-0llvm/unittests/Support/HTTP/HTTPServerTests.cpp
+0-310llvm/unittests/Debuginfod/HTTPServerTests.cpp
+0-198llvm/lib/Debuginfod/HTTPServer.cpp
+195-0llvm/lib/Support/HTTP/HTTPServer.cpp
+164-0llvm/lib/Support/HTTP/HTTPClient.cpp
+0-162llvm/lib/Debuginfod/HTTPClient.cpp
+669-67020 files not shown
+933-91626 files

LLVM/project 8b0a9d0llvm/lib/Target/X86 X86InstrAVX10.td, llvm/test/MC/X86/Relocations avx10.2satcvt.s

[X86] Fix wrong RIP-relative relocations for AVX10.2 saturation conversions (#185254)

AVX10.2 saturation conversion instructions (VCVT{T}{BF16,PH,PS}2I{U}BS)
incorrectly inherited from AVX512{PS,XD,PD}Ii8Base, which sets
ImmT=Imm8. This caused X86MCCodeEmitter to account for a nonexistent
trailing immediate byte when computing RIP-relative displacement fixups,
producing an addend of -5 instead of the correct -4.

Replace AVX512PSIi8Base/AVX512XDIi8Base/AVX512PDIi8Base with just the
needed prefix classes (PS is default, XD, PD), dropping the bogus
ImmT=Imm8. Preserve the original ExeDomain values (SSEPackedSingle,
SSEPackedDouble, SSEPackedInt; verified with `llvm-tblgen
--print-records ./llvm/lib/Target/X86/X86.td -I./llvm/include/
-I./llvm/lib/Target/X86/`). ExeDomain is not covered by any test.

Fix https://github.com/llvm/llvm-project/issues/184251
DeltaFile
+40-0llvm/test/MC/X86/Relocations/avx10.2satcvt.s
+24-12llvm/lib/Target/X86/X86InstrAVX10.td
+64-122 files

LLVM/project faa1cccclang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/non-overloaded vdota4su_vv.c vdota4_vv.c, clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/overloaded vdota4su_vv.c vdota4_vv.c

[RISCV][llvm] Use [u]int32 vector for input arguments for zvdot4a8i (#184089)

intrinsic spec PR:
https://github.com/riscv-non-isa/rvv-intrinsic-doc/pull/422
DeltaFile
+75-75clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/non-overloaded/vdota4su_vv.c
+75-75clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/overloaded/vdota4su_vv.c
+73-73clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/non-overloaded/vdota4_vv.c
+73-73clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/overloaded/vdota4_vv.c
+67-67clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/non-overloaded/vdota4u_vv.c
+67-67clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvdot4a8i/policy/overloaded/vdota4u_vv.c
+430-43029 files not shown
+1,573-1,58735 files

HardenedBSD/src 5cbf7b4sys/dev/cxgbe t4_main.c t4_sge.c

Merge remote-tracking branch 'origin/hardened/current/master' into hardened/current/cross-dso-cfi
DeltaFile
+8-6sys/dev/cxgbe/t4_main.c
+4-6sys/dev/cxgbe/t4_sge.c
+12-122 files

HardenedBSD/src 17bb91asys/dev/cxgbe t4_main.c t4_sge.c

Merge branch 'freebsd/current/main' into hardened/current/master
DeltaFile
+8-6sys/dev/cxgbe/t4_main.c
+4-6sys/dev/cxgbe/t4_sge.c
+12-122 files

LLVM/project df563b9llvm/lib/Transforms/InstCombine InstCombineSelect.cpp

Address comments.
DeltaFile
+14-52llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
+14-521 files

HardenedBSD/ports 37b6083devel/cargo-c distinfo Makefile.crates, sysutils/aoostar-rs distinfo Makefile

Merge branch 'freebsd/main' into hardenedbsd/main
DeltaFile
+273-239devel/cargo-c/distinfo
+501-0sysutils/aoostar-rs/distinfo
+284-0sysutils/aoostar-rs/Makefile
+135-118devel/cargo-c/Makefile.crates
+71-0sysutils/aoostar-rs/files/aoostar-lcd.in
+16-0sysutils/aoostar-rs/pkg-descr
+1,280-35710 files not shown
+1,296-38716 files

LLVM/project e20ae16libclc CMakeLists.txt, libclc/clc/lib/generic CMakeLists.txt SOURCES

[libclc] Use custom CMake handling to overhaul libclc compilation (#185247)

Summary:
This PR uses https://github.com/llvm/llvm-project/pull/185243 to
overhaul compilation of libclc. This brings libclc to the same kind of
compilation flow that the other GPU libraries use (compiler-rt, libc,
libc++, openmp, flang-rt).

The main brunt of this change is simply changing the SOURCES files to
CMake variables and altering the compilation. Now that these are
standard CMake libraries we do not need to bother redefining custom
library handling and targets.

This builds as a static library, which we then consume with `llvm-link`
which converts it into a single `.bc` bitcode file similarly to before.
The final result is then optimized all together.

Hopefully this doesn't break anything.
DeltaFile
+156-535libclc/cmake/modules/AddLibclc.cmake
+165-318libclc/CMakeLists.txt
+240-0libclc/opencl/lib/generic/CMakeLists.txt
+0-220libclc/opencl/lib/generic/SOURCES
+207-0libclc/clc/lib/generic/CMakeLists.txt
+0-187libclc/clc/lib/generic/SOURCES
+768-1,26016 files not shown
+1,020-1,46122 files

FreeBSD/src 8f72d93sys/dev/cxgbe t4_main.c t4_sge.c

cxgbe(4): minor changes in code dealing with ncores

1. ncores and devlog information is read as a combination so it makes
   sense to validate them in the same routine (and nowhere else).
2. ncores is never 0 and idx % ncores is always a valid coreid.

MFC after:      1 week
Sponsored by:   Chelsio Communications
DeltaFile
+8-6sys/dev/cxgbe/t4_main.c
+4-6sys/dev/cxgbe/t4_sge.c
+12-122 files

HardenedBSD/src 8f72d93sys/dev/cxgbe t4_main.c t4_sge.c

cxgbe(4): minor changes in code dealing with ncores

1. ncores and devlog information is read as a combination so it makes
   sense to validate them in the same routine (and nowhere else).
2. ncores is never 0 and idx % ncores is always a valid coreid.

MFC after:      1 week
Sponsored by:   Chelsio Communications
DeltaFile
+8-6sys/dev/cxgbe/t4_main.c
+4-6sys/dev/cxgbe/t4_sge.c
+12-122 files

LLVM/project 76cfb0bllvm/lib/Target/AMDGPU SIInsertWaitcnts.cpp

[NFC][AMDGPU] WaitEventSet should only print events that it contains
DeltaFile
+2-1llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
+2-11 files

LLVM/project d0ddae6flang/lib/Lower HlfirIntrinsics.cpp, flang/lib/Optimizer/Builder IntrinsicCall.cpp

[flang] Fix segfault in CSHIFT/EOSHIFT with dynamically optional DIM (#184431)

When `DIM` is passed as an optional dummy argument and is absent at
runtime, the HLFIR lowering for the `CSHIFT` and `EOSHIFT` intrinsics
treated it as unconditionally present. This resulted in an unconditional
load of the `DIM` reference, causing a null pointer dereference and a
runtime segmentation fault when absent.

The underlying issue was that the `dim` argument for `cshift` and
`eoshift` was not marked with `handleDynamicOptional` during intrinsic
argument lowering setup. As a result, the `isPresent` state was never
populated, and the lowering implementation incorrectly fell through to
an unconditional scalar load.

This patch resolves the issue by:
1. Updating the `dim` entries for `cshift` and `eoshift` in
`IntrinsicCall.cpp` to use `handleDynamicOptional`. This enables
`getOperandVector()` to appropriately emit a guarded load (via
`loadOptionalValue()`) that safely returns a 0 placeholder when `DIM` is

    [10 lines not shown]
DeltaFile
+50-0flang/test/Lower/HLFIR/cshift-optional-dim.f90
+18-2flang/lib/Lower/HlfirIntrinsics.cpp
+4-2flang/lib/Optimizer/Builder/IntrinsicCall.cpp
+72-43 files

LLVM/project 874ef00flang/lib/Lower ConvertConstant.cpp, flang/test/Lower pdt-struct-constructor-init.f90

[Flang] Fix crash in structure constructor lowering for PDT (#183543)

Fixes - [#181278](https://github.com/llvm/llvm-project/issues/181278)
This patch fixes a crash in Flang when parsing array constructors like:
 

`[ty0(2)(4)]`
 
The current implementation parses this as a type constructor ty0(2),
followed by what appears to be another call (4), instead of rejecting it
as invalid syntax. The lowering of` StructureConstructor` attempts to
retrieve the parent derived type using `sym->owner().derivedTypeSpec()`,
which return `nullptr` for PDT cases and lead to a crash.
 
In` flang/lib/Lower/ConvertConstant.cpp`, a safeguard is being added
which ensures that we fall back to the constructor’s derived type
specification when the parent type cannot be obtained, preventing the
null dereference and eliminating the crash. This change addresses only
the immediate crash, proper diagnostic handling for this invalid syntax

    [4 lines not shown]
DeltaFile
+18-0flang/test/Lower/pdt-struct-constructor-init.f90
+6-0flang/lib/Lower/ConvertConstant.cpp
+24-02 files

LLVM/project e30f9c1llvm/test/Transforms/LoopVectorize iv-select-cmp-decreasing.ll vector-loop-backedge-elimination.ll, llvm/test/Transforms/LoopVectorize/AArch64 optsize_minsize.ll

Revert "Reapply "[VPlan] Remove manual region removal when simplifying for VF and UF. (#181252)""

This reverts commit 6aa115bba55054b0dc81ebfc049e8c7a29e614b2.

This is causing crashes. See #185345 for details.
DeltaFile
+205-87llvm/test/Transforms/LoopVectorize/iv-select-cmp-decreasing.ll
+0-266llvm/test/Transforms/LoopVectorize/vector-loop-backedge-elimination.ll
+125-84llvm/test/Transforms/LoopVectorize/AArch64/optsize_minsize.ll
+9-187llvm/test/Transforms/LoopVectorize/epilog-vectorization-reductions.ll
+116-63llvm/test/Transforms/LoopVectorize/load-deref-pred-poison-ub-ops-feeding-pointer.ll
+78-54llvm/test/Transforms/LoopVectorize/X86/epilog-vectorization-ordered-reduction.ll
+533-74117 files not shown
+914-99623 files

LLVM/project 100602ellvm/lib/Target/AMDGPU SIRegisterInfo.cpp, llvm/test/CodeGen/AMDGPU vgpr-spill.mir

[AMDGPU] Multi dword spilling for unaligned tuples

While spilling unaligned tuples, rather than breaking the
spill into 32-bit accesses, spill the first register as a
single 32-bit spill, and spill the remainder of the tuple
as an aligned tuple.
Some additional bookkeeping is required in the spilling
loop to manage the state.
DeltaFile
+44-7llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
+8-26llvm/test/CodeGen/AMDGPU/vgpr-spill.mir
+52-332 files

FreeBSD/ports 2ca1210sysutils/py-dbuild distinfo Makefile, sysutils/py-dbuild/files patch-dbuild_ci__test.py

sysutils/py-dbuild: Update to 1.5.0

ChangeLog:
https://github.com/daemonless/dbuild/compare/v1.4.0...v1.5.0
DeltaFile
+0-10sysutils/py-dbuild/files/patch-dbuild_ci__test.py
+3-3sysutils/py-dbuild/distinfo
+1-1sysutils/py-dbuild/Makefile
+4-143 files

HardenedBSD/ports 2ca1210sysutils/py-dbuild distinfo Makefile, sysutils/py-dbuild/files patch-dbuild_ci__test.py

sysutils/py-dbuild: Update to 1.5.0

ChangeLog:
https://github.com/daemonless/dbuild/compare/v1.4.0...v1.5.0
DeltaFile
+0-10sysutils/py-dbuild/files/patch-dbuild_ci__test.py
+3-3sysutils/py-dbuild/distinfo
+1-1sysutils/py-dbuild/Makefile
+4-143 files

LLVM/project c6b378allvm/lib/Target/AMDGPU SIInstructions.td SIRegisterInfo.cpp, llvm/test/CodeGen/AMDGPU vgpr-spill.mir

[AMDGPU] Multi dword spilling for unaligned tuples

While spilling unaligned tuples, rather than breaking the
spill into 32-bit accesses, spill the first register as a
single 32-bit spill, and spill the remainder of the tuple
as an aligned tuple.
Some additional bookkeeping is required in the spilling
loop to manage the state.
DeltaFile
+281-38llvm/test/CodeGen/AMDGPU/vgpr-spill.mir
+39-39llvm/lib/Target/AMDGPU/SIInstructions.td
+65-5llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
+1-1llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+386-834 files

HardenedBSD/ports a7f23fedevel/cargo-c distinfo Makefile.crates

devel/cargo-c: update to 0.10.21

Changes: https://github.com/lu-zero/cargo-c/releases/tag/v0.10.21
DeltaFile
+273-239devel/cargo-c/distinfo
+135-118devel/cargo-c/Makefile.crates
+2-3devel/cargo-c/Makefile
+410-3603 files

FreeBSD/ports a7f23fedevel/cargo-c distinfo Makefile.crates

devel/cargo-c: update to 0.10.21

Changes: https://github.com/lu-zero/cargo-c/releases/tag/v0.10.21
DeltaFile
+273-239devel/cargo-c/distinfo
+135-118devel/cargo-c/Makefile.crates
+2-3devel/cargo-c/Makefile
+410-3603 files

LLVM/project 1d6607dllvm/lib/Target/AMDGPU SIInstructions.td SIRegisterInfo.cpp, llvm/test/CodeGen/AMDGPU vgpr-spill.mir

[AMDGPU] Remove alignment constraint from spill pseudos (#177317)

Spill pseudo opcodes don't require target reg class alignment
constraint.
For targets which do have alignment constraints, lower the spills to
32-bit accesses.
Update the machine verifier accordingly.
Sgpr spill pseudos didn't enforce alignment constraints.
Modify vgpr spills reg class to not enforce them either.
DeltaFile
+299-38llvm/test/CodeGen/AMDGPU/vgpr-spill.mir
+39-39llvm/lib/Target/AMDGPU/SIInstructions.td
+27-3llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
+1-1llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+366-814 files

LLVM/project f68af65mlir/lib/Dialect/X86/Transforms VectorContractToPackedTypeDotProduct.cpp, mlir/lib/Dialect/X86/Utils X86Utils.cpp

[mlir][x86] Extends vector.contract Flat dot-product lowering for arg offset (#185167)

Extends `vector_contract_to_packed_type_dot_product` transform pass to
include `args` offset check while validating the `vector.contract`.
Eg: `vector.transfer_read %arg1[%arg3, %c0], %0 {in_bounds = [true,
true]} : !memref, !vec`
DeltaFile
+70-0mlir/test/Dialect/X86/vector-contract-to-packed-type-dotproduct.mlir
+9-9mlir/lib/Dialect/X86/Transforms/VectorContractToPackedTypeDotProduct.cpp
+11-4mlir/lib/Dialect/X86/Utils/X86Utils.cpp
+90-133 files

LLVM/project 95a9647mlir/include/mlir/Dialect/LLVMIR NVVMOps.td, mlir/lib/Dialect/LLVMIR/IR NVVMDialect.cpp

[MLIR][NVVM] Add nvvm.fma Op (#184776)

Adds `nvvm.fma` Op to the NVVM dialect to perform fused multiply-add
operations.

PTX ISA Reference:
1.
https://docs.nvidia.com/cuda/parallel-thread-execution/#floating-point-instructions-fma
2.
https://docs.nvidia.com/cuda/parallel-thread-execution/#half-precision-floating-point-instructions-fma
DeltaFile
+294-0mlir/test/Target/LLVMIR/nvvm/fma/fma_vector.mlir
+146-25mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp
+114-0mlir/test/Target/LLVMIR/nvvm/fma/fma.mlir
+89-0mlir/test/Target/LLVMIR/nvvm/fma/fma_invalid.mlir
+47-0mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
+43-0mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
+733-256 files

NetBSD/pkgsrc bGP8rDVtextproc Makefile

   add and enable various modules needed to build docs for Cyrus IMAPd
VersionDeltaFile
1.1581+8-1textproc/Makefile
+8-11 files

NetBSD/pkgsrc Z8syLpodoc CHANGES-2026

   doc: Added textproc/py-sphinxcontrib-seqdiag version 3.0.0
VersionDeltaFile
1.1622+2-1doc/CHANGES-2026
+2-11 files

NetBSD/pkgsrc bbCTQE2textproc/py-sphinxcontrib-seqdiag Makefile PLIST

   textproc/py-sphinxcontrib-seqdiag: import py313-sphinxcontrib-seqdiag-3.0.0

   This extension enables you to insert sequence diagrams into your document.
VersionDeltaFile
1.1+21-0textproc/py-sphinxcontrib-seqdiag/Makefile
1.1+12-0textproc/py-sphinxcontrib-seqdiag/PLIST
1.1+5-0textproc/py-sphinxcontrib-seqdiag/distinfo
1.1+1-0textproc/py-sphinxcontrib-seqdiag/DESCR
+39-04 files