NetBSD/pkgsrc-wip b4cc592basu distinfo Makefile

Revert "basu: switch to browser download filename"

This reverts commit d443f9931389e43df4e796c62d42e16318d3d6f2.

sr.ht download works again
DeltaFile
+3-3basu/distinfo
+2-0basu/Makefile
+5-32 files

LLVM/project 7f565d7llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp AMDGPUCoExecSchedStrategy.h

Fix naming

Change-Id: Ic1b5d1c2729a20ef9810ef92f11d870ce5fa3d96
DeltaFile
+4-4llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+1-1llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+5-52 files

LLVM/project de818dcllvm/lib/Analysis LoopAccessAnalysis.cpp, llvm/test/Analysis/LoopAccessAnalysis single_strided_readwrite.ll

[LAA] Speculate `stride != 0` for single read-write access

Stacked on top of https://github.com/llvm/llvm-project/pull/186262.
DeltaFile
+4-12llvm/test/Analysis/LoopAccessAnalysis/single_strided_readwrite.ll
+7-1llvm/lib/Analysis/LoopAccessAnalysis.cpp
+11-132 files

LLVM/project b4f50cflibc/src/__support/FPUtil bfloat16.h NearestIntegerOperations.h, libc/src/__support/FPUtil/generic add_sub.h

[libc][math] Qualify ceil functions to constexpr (#184948)
DeltaFile
+63-0libc/test/shared/shared_math_constexpr_test.cpp
+13-13libc/src/__support/FPUtil/generic/add_sub.h
+11-11libc/src/__support/FPUtil/bfloat16.h
+1-16libc/test/shared/shared_math_test.cpp
+8-8libc/src/__support/FPUtil/NearestIntegerOperations.h
+7-7libc/src/__support/FPUtil/comparison_operations.h
+103-5511 files not shown
+143-7917 files

LLVM/project 4f32ea3flang-rt/lib/cuda registration.cpp, flang/include/flang/Runtime/CUDA registration.h

[flang][cuda] Fix const mismatch in CUFRegisterManagedVariable for __cudaRegisterManagedVar (#188142)

Change varName parameter from `const char *` to `char *` in
CUFRegisterManagedVariable to match the CUDA runtime API signature of
__cudaRegisterManagedVar, which declares deviceAddress as `char *`.
DeltaFile
+1-1flang-rt/lib/cuda/registration.cpp
+1-1flang/include/flang/Runtime/CUDA/registration.h
+2-22 files

LLVM/project 7623c02llvm/lib/Target/AArch64 AArch64ISelDAGToDAG.cpp

fixup! Reuse SelectMultiVectorLuti()
DeltaFile
+6-36llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
+6-361 files

LLVM/project 157f6c0llvm/lib/Target/AArch64 AArch64ISelDAGToDAG.cpp

fixup! Add overloaded AArch64DAGToDAGISel::EmitMultiVectorLutiLane() for reuse
DeltaFile
+39-50llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
+39-501 files

LLVM/project 8b29814clang/lib/Sema SemaARM.cpp, clang/test/CodeGen/AArch64/sme2p3-intrinsics acle_sme2p3_luti6.c

fixup! Address more PR comments
DeltaFile
+21-3clang/lib/Sema/SemaARM.cpp
+4-12llvm/include/llvm/IR/IntrinsicsAArch64.td
+12-4clang/test/CodeGen/AArch64/sme2p3-intrinsics/acle_sme2p3_luti6.c
+6-6clang/test/CodeGen/AArch64/sve2p3-intrinsics/acle_sve2p3_luti6.c
+9-0clang/test/Sema/aarch64-sve2p3-intrinsics/acle_sve2p3_target_lane.c
+2-3llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
+54-284 files not shown
+59-3410 files

LLVM/project 74120b0llvm/lib/Target/AArch64 AArch64InstrInfo.td AArch64SVEInstrInfo.td

fixup! Address PR comments
DeltaFile
+1-10llvm/lib/Target/AArch64/AArch64InstrInfo.td
+2-2llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
+3-122 files

LLVM/project c522432clang/lib/Basic/Targets AArch64.cpp, clang/test/Preprocessor aarch64-target-features.c

fixup! Address PR comments
DeltaFile
+18-12llvm/lib/Target/AArch64/SVEInstrFormats.td
+0-29clang/lib/Basic/Targets/AArch64.cpp
+0-23clang/test/Preprocessor/aarch64-target-features.c
+13-9llvm/lib/Target/AArch64/SMEInstrFormats.td
+1-6llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
+1-4llvm/lib/Target/AArch64/AArch64SMEInstrInfo.td
+33-831 files not shown
+33-857 files

LLVM/project 38ca9b5clang/test/CodeGen/AArch64/sme2p3-intrinsics acle_sme2p3_luti6.c, clang/test/CodeGen/AArch64/sve2p3-intrinsics acle_sve2p3_luti6.c

[AArch64][clang][llvm] Add support for Armv9.7-A lookup table intrinsics

Add support for the following Armv9.7-A Lookup Table (lut)
instruction intrinsics:

SVE2.3
```c
  // Variant is  also available for: _u8 _mf8
  svint8_t svluti6[_s8](svint8x2_t table, svuint8_t indices);
```

SVE2.3 and SME2.3
``` c
  // Variants are also available for _u16_x2 and _f16_x2.
  svint16_t svluti6_lane[_s16_x2](svint16x2_t table, svuint8_t indices, uint64_t imm_idx);
```

SME2.3
```c

    [9 lines not shown]
DeltaFile
+175-0clang/test/CodeGen/AArch64/sme2p3-intrinsics/acle_sme2p3_luti6.c
+112-0clang/test/CodeGen/AArch64/sve2p3-intrinsics/acle_sve2p3_luti6.c
+105-0llvm/test/CodeGen/AArch64/sme2p3-intrinsics-luti6.ll
+102-0llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
+79-0llvm/test/Verifier/AArch64/luti6-intrinsics.ll
+55-0llvm/test/CodeGen/AArch64/sve2p3-intrinsics-luti6.ll
+628-016 files not shown
+876-322 files

NetBSD/pkgsrc-wip c326222. Makefile, wlrctl Makefile distinfo

Import wlrctl
DeltaFile
+22-0wlrctl/Makefile
+5-0wlrctl/distinfo
+5-0wlrctl/DESCR
+4-0wlrctl/PLIST
+1-0Makefile
+37-05 files

LLVM/project 71a0b9fllvm/lib/Analysis LoopAccessAnalysis.cpp, llvm/test/Analysis/LoopAccessAnalysis single_strided_readwrite.ll

[LAA] Allow vectorizing `A[NonZeroNonConstantStride*I] += 1`

In this patch only do that when we can statically prove that
non-constant stride is non-zero and the resulting index doesn't
overflow. That can later be extended to introduce run-time check when
not provable in compile-time.

My main motivation for this is to move unit-strideness speculation to a
VPlan-based transformation. However, it cannot be done right now because
sometimes such speculation affects legality and we simply avoid
vectorizing loop if it's not done. As such, we need to extend LAA to
properly support dependence analysis/RT checks for strided access
without speculating for it being one. This PR is expected to be the
first one on that journey.
DeltaFile
+83-12llvm/test/Analysis/LoopAccessAnalysis/single_strided_readwrite.ll
+19-6llvm/lib/Analysis/LoopAccessAnalysis.cpp
+102-182 files

LLVM/project 6f66cdbllvm/lib/Analysis LoopAccessAnalysis.cpp

[NFCI][LAA] Add `getPtrStrideScev` function
DeltaFile
+68-50llvm/lib/Analysis/LoopAccessAnalysis.cpp
+68-501 files

LLVM/project dae0ec3llvm/test/Analysis/LoopAccessAnalysis single_strided_readwrite.ll

[NFC][LAA] Add a test for a single strided-read-write access
DeltaFile
+243-0llvm/test/Analysis/LoopAccessAnalysis/single_strided_readwrite.ll
+243-01 files

LLVM/project 3e3b909llvm/lib/CodeGen/GlobalISel CombinerHelper.cpp, llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp

[AMDGPU][DAGCombiner][GlobalISel] Extend allMulUsesCanBeContracted with FMA/FMAD pattern

Add conservative FMA/FMAD recognition to allMulUsesCanBeContracted:
a multiply used by an existing FMA/FMAD is assumed to be contractable
(it's already being contracted elsewhere). This avoids unnecessary
contraction blocking for multiplies that feed into FMA chains.

Also adds FMA/FMAD to the FPEXT user set (fpext(fmul) --> fma is
recognized as contractable when isFPExtFoldable).

Guards all remaining FMA-chain reassociation fold sites in both
SDAG (visitFADDForFMACombine/visitFSUBForFMACombine, 8 sites) and
GISel (matchCombineFAddFpExtFMulToFMadOrFMAAggressive, 4 sites).

This re-enables contractions that were conservatively blocked in
earlier patches where the multiply had an FMA use that wasn't yet
recognized: dagcombine-fma-crash.ll and dagcombine-fma-fmad.ll
CHECK lines revert to upstream behavior.

Co-Authored-By: Claude Opus 4.6 <noreply at anthropic.com>
DeltaFile
+95-96llvm/test/CodeGen/AMDGPU/dagcombine-fma-fmad.ll
+20-3llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+10-12llvm/test/CodeGen/AMDGPU/dagcombine-fma-crash.ll
+17-2llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
+142-1134 files

LLVM/project 24087e3llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.h AMDGPUCoExecSchedStrategy.cpp

Claude Code Review + Naming

Change-Id: I0a8d2db1afc560fd23c009cf109f3e9929d324be
DeltaFile
+6-5llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+4-4llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+10-92 files

LLVM/project f34cf3fllvm/include/llvm/CodeGen/GlobalISel CombinerHelper.h, llvm/lib/CodeGen/GlobalISel CombinerHelper.cpp

[AMDGPU][DAGCombiner][GlobalISel] Extend allMulUsesCanBeContracted with FPEXT pattern

Extend the allMulUsesCanBeContracted analysis to recognize FPEXT patterns
where the multiply result flows through fpext before being used in
contractable operations (fadd, fsub). This covers:
  - fmul --> fpext --> {fadd, fsub}: FPEXT folds if isFPExtFoldable
  - fmul --> fpext --> fneg --> fsub: FPEXT then FNEG to FSUB
  - fmul --> fneg --> fpext --> fsub: FNEG then FPEXT folds if foldable

Also adds allMulUsesCanBeContracted guards to all FPEXT fold sites in
both SDAG (visitFADDForFMACombine, visitFSUBForFMACombine) and GISel
(matchCombineFAddFpExtFMulToFMadOrFMA, matchCombineFSubFpExtFMulToFMadOrFMA,
matchCombineFSubFpExtFNegFMulToFMadOrFMA).

Fixes a missing isFPExtFoldable check in GISel's
matchCombineFSubFpExtFMulToFMadOrFMA which could fold without verifying
the extension is actually foldable.

Co-Authored-By: Claude Opus 4.6 <noreply at anthropic.com>
DeltaFile
+1,930-11llvm/test/CodeGen/AMDGPU/fma-multiple-uses-contraction.ll
+93-14llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
+78-13llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+2-1llvm/include/llvm/CodeGen/GlobalISel/CombinerHelper.h
+2,103-394 files

LLVM/project 0f3fc35llvm/lib/CodeGen/GlobalISel CombinerHelper.cpp, llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp

[AMDGPU][DAGCombiner][GlobalISel] Extend allMulUsesCanBeContracted with FNEG pattern

Extend allMulUsesCanBeContracted() to recognize fmul -> fneg -> fsub
chains as contractable uses. This allows FMA contraction when a multiply
feeds an fneg that is only used by fsub operations.

Changes:
- DAGCombiner.cpp: Add ISD::FNEG case to allMulUsesCanBeContracted()
  checking that all FNEG users are ISD::FSUB. Update 1 fold site guard
  in visitFSUBForFMACombine (fsub(fneg(fmul))).
- CombinerHelper.cpp: Add G_FNEG case to allMulUsesCanBeContracted()
  checking that all FNEG users are G_FSUB. Update 2 fold site guards
  in matchCombineFSubFNegFMulToFMadOrFMA. Fix guard ordering to check
  isContractableFMul before allMulUsesCanBeContracted (cheap first).
- Add 7 new test functions to fma-multiple-uses-contraction.ll covering
  fneg single-use, multi-use, mixed contractable/non-contractable, and
  cross-pattern (P1 direct + P2 fneg) interactions.
- Update mad-combine.ll CHECK lines affected by the guard changes.


    [4 lines not shown]
DeltaFile
+666-0llvm/test/CodeGen/AMDGPU/fma-multiple-uses-contraction.ll
+33-7llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
+22-2llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+4-5llvm/test/CodeGen/AMDGPU/mad-combine.ll
+725-144 files

LLVM/project e6ba1b4llvm/lib/CodeGen/GlobalISel CombinerHelper.cpp, llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp

[DAGCombiner][GlobalISel] Prevent FMA contraction when fmul cannot be eliminated (FADD/FSUB pattern)

fmul nodes with multiple uses can currently be contracted into FMA
operations even when the fmul itself cannot be eliminated, resulting in
a redundant multiply (wasted power and compute). The existing guard
`Aggressive || N0->hasOneUse()` allows contraction under Aggressive mode
regardless of whether the multiply can be removed.

This patch tightens the guard to:
  `N0->hasOneUse() || (Aggressive && allMulUsesCanBeContracted(N0))`

`allMulUsesCanBeContracted()` iterates all users of the multiply and
returns true only if every use is itself contractable into an FMA.
For this first patch, only direct FADD and FSUB uses are recognized as
contractable (FNEG, FPEXT, and FMA/FMAD patterns follow in subsequent
patches).

The change is applied symmetrically to both DAGCombiner and GlobalISel:
- DAGCombiner: 4 fold sites in visitFADDForFMACombine (2 sites) and

    [8 lines not shown]
DeltaFile
+835-0llvm/test/CodeGen/AMDGPU/fma-multiple-uses-contraction.ll
+115-148llvm/test/CodeGen/AMDGPU/fma.f16.ll
+95-94llvm/test/CodeGen/AMDGPU/dagcombine-fma-fmad.ll
+61-26llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+25-25llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-pow-codegen.ll
+43-5llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
+1,174-2987 files not shown
+1,217-33513 files

LLVM/project 77fb848llvm/lib/Transforms/Vectorize LoopVectorize.cpp, llvm/test/Transforms/LoopVectorize epilog-vectorization-reductions.ll

Reapply "[LV] Simplify and unify resume value handling for epilogue vec." (#187504)

This reverts commit cdaf29f84dd0abbd1f961982799059c92d76625b.

This version skips removeBranchOnConst when vectorizing the epilogue, as
it may trigger folds that remove the resume phi used as resume value
from the epilogue.

This fixes https://github.com/llvm/llvm-project/issues/187323.

Original message:
This patch tries to drastically simplify resume value handling for the
scalar loop when vectorizing the epilogue.

It uses a simpler, uniform approach for updating all resume values in
the scalar loop:

1. Create ResumeForEpilogue recipes for all scalar resume phis in the
main loop (the epilogue plan will have exactly the same scalar resume

    [23 lines not shown]
DeltaFile
+62-197llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+115-28llvm/test/Transforms/LoopVectorize/epilog-vectorization-reductions.ll
+102-4llvm/test/Transforms/LoopVectorize/AArch64/epilog-vectorization-widen-inductions.ll
+14-14llvm/test/Transforms/LoopVectorize/X86/intrinsiccost.ll
+14-14llvm/test/Transforms/LoopVectorize/AArch64/intrinsiccost.ll
+15-10llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-cost.ll
+322-26728 files not shown
+398-36234 files

LLVM/project d194afemlir/include/mlir/Transforms Passes.td

[MLIR] [Mem2Reg] [NFC] Update pass documentation (#188140)

In #185036, we added region control flow support but forgot to update
the pass declaration documentation. This NFC addresses this.
DeltaFile
+2-5mlir/include/mlir/Transforms/Passes.td
+2-51 files

FreeBSD/src 73f72casys/netinet tcp_var.h

tcp: make TCP_TRK_TRACK_FLG_* flags visible to userland programs

These flags are used in BBLog entries.
DeltaFile
+9-9sys/netinet/tcp_var.h
+9-91 files

LLVM/project 20e863dutils/bazel/llvm-project-overlay/libc BUILD.bazel

[Bazel] Fixes 599d74f (#188138)

This fixes 599d74fdc3cee5fa1ed84d6e10830f9aa5fe6b53.
DeltaFile
+2-0utils/bazel/llvm-project-overlay/libc/BUILD.bazel
+2-01 files

LLVM/project b31dcefllvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp AMDGPUCoExecSchedStrategy.h, llvm/test/CodeGen/AMDGPU coexec-scheduler.ll

Review comments + fix test

Change-Id: I8d9d3fe7394997540d0257d8e4859b7ed811c3de
DeltaFile
+56-52llvm/test/CodeGen/AMDGPU/coexec-scheduler.ll
+11-11llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+2-6llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+69-693 files

LLVM/project bcfe1e9clang/lib/Sema SemaHLSL.cpp SemaCast.cpp, clang/test/CodeGenHLSL/BasicFeatures AggregateSplatCast.hlsl

[HLSL] Allow 1x1 matrices to be splatted like scalars (#188119)

Fixes #186859 by allowing 1x1 matrices to be splatted like the scalar
and vec1 cases.

Assisted-by: GitHub Copilot (powered by Claude Opus 4.6)
DeltaFile
+33-0clang/test/CodeGenHLSL/BasicFeatures/AggregateSplatCast.hlsl
+9-4clang/lib/Sema/SemaHLSL.cpp
+6-0clang/lib/Sema/SemaCast.cpp
+48-43 files

FreeBSD/ports a2a73aebiology/wfa2-lib distinfo pkg-plist

biology/wfa2-lib: Update to 2.3.6

Bug fix release
Changes: https://github.com/smarco/WFA2-lib/commits/main/

Reported by:    portscout
DeltaFile
+3-3biology/wfa2-lib/distinfo
+2-0biology/wfa2-lib/pkg-plist
+1-1biology/wfa2-lib/Makefile
+6-43 files

LLVM/project 4720f9clibclc/clc/lib/generic/conversion clc_convert_float.inc clc_convert_float2float.cl

libclc: Use nextup and nextdown in place of nextafter

Unfortunately it seems the optimizer isn't able to clean this
up, so this is a code quality improvement.
DeltaFile
+3-4libclc/clc/lib/generic/conversion/clc_convert_float.inc
+2-0libclc/clc/lib/generic/conversion/clc_convert_float2float.cl
+2-0libclc/clc/lib/generic/conversion/clc_convert_int2float.cl
+7-43 files

LLVM/project 599d74flibc/src/__support/math acoshf_utils.h CMakeLists.txt, libc/test/src/math atanhf_test.cpp asinhf_test.cpp

[libc][math] Reduce memory usage for single precision inverse hyperbolic functions for LIBC_MATH_HAS_SMALL_TABLES option. (#188110)
DeltaFile
+54-2libc/src/__support/math/acoshf_utils.h
+9-2libc/test/src/math/atanhf_test.cpp
+2-2libc/test/src/math/asinhf_test.cpp
+1-1libc/test/src/math/acoshf_test.cpp
+2-0libc/src/__support/math/CMakeLists.txt
+68-75 files

FreeBSD/ports f0dc3edmisc/ggml pkg-plist distinfo, misc/ggml/files patch-19504

misc/ggml: update 0.9.7 → 0.9.8

PR:     293988
DeltaFile
+0-563misc/ggml/files/patch-19504
+5-4misc/ggml/pkg-plist
+3-3misc/ggml/distinfo
+3-2misc/ggml/Makefile
+11-5724 files