LLVM / project - FreshBSD

LLVM/project 5759a3a — clang/test/CodeGenOpenCL builtins-amdgcn-gfx1250.cl, llvm/lib/Target/AMDGPU SIISelLowering.cpp SOPInstructions.td

2025-12-10 08:45:13 UTC by Mirko Brkušanin via GitHub on ⎇

main

[AMDGPU] Add s_wakeup_barrier instruction for gfx1250 (#170501)

Delta		File
+41	-0	llvm/test/CodeGen/AMDGPU/s-wakeup-barrier.ll
+25	-4	llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+15	-0	clang/test/CodeGenOpenCL/builtins-amdgcn-gfx1250.cl
+14	-0	llvm/lib/Target/AMDGPU/SOPInstructions.td
+14	-0	llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
+12	-0	llvm/test/MC/AMDGPU/gfx1250_asm_sop1.s
+121	-4	12 files not shown
+160	-11	18 files

LLVM/project 804e768 — llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp, llvm/test/CodeGen/X86 avgfloors-scalar.ll avgflooru-scalar.ll

2025-12-10 08:44:11 UTC by Simon Pilgrim via GitHub on ⎇

main

[DAG] Recognise AVGFLOOR (((A >> 1) + (B >> 1)) + (A & B & 1)) patterns (#169644)

Recognise 'LSB' style AVGFLOOR patterns.

Alive2:
[https://alive2.llvm.org/ce/z/nfSSk_](https://alive2.llvm.org/ce/z/nfSSk_)

Fixes #53648

Delta		File
+36	-60	llvm/test/CodeGen/X86/avgfloors-scalar.ll
+30	-66	llvm/test/CodeGen/X86/avgflooru-scalar.ll
+15	-4	llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+81	-130	3 files

LLVM/project a108881 — llvm/lib/Target/LoongArch LoongArchISelLowering.cpp LoongArchLSXInstrInfo.td, llvm/test/CodeGen/LoongArch/lasx issue170976.ll

2025-12-10 08:39:29 UTC by hev via GitHub on ⎇

main

[LoongArch] Custom lowering for vector logical right shifts of integers (#171097)

After PR #169491, the DAG combiner can still recreate vector UDIV in an
illegal type even after type legalization, which is the root cause of
issue #170976.

The optimization introduced in PR #169491 is still desirable, so this
patch adds custom lowering for vector integer logical right shifts to
prevent the DAG from producing nodes with illegal types.

Fixes #170976

Delta		File
+74	-0	llvm/test/CodeGen/LoongArch/lasx/issue170976.ll
+74	-0	llvm/test/CodeGen/LoongArch/lsx/issue170976.ll
+45	-2	llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
+25	-11	llvm/lib/Target/LoongArch/LoongArchLSXInstrInfo.td
+9	-18	llvm/test/CodeGen/LoongArch/lasx/ir-instruction/avg.ll
+9	-18	llvm/test/CodeGen/LoongArch/lsx/ir-instruction/avg.ll
+236	-49	4 files not shown
+251	-60	10 files

LLVM/project 9039721 — llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

2025-12-10 08:28:50 UTC by Diana Picus via GitHub on ⎇

users/rovka/alloc-vgpr-intrinsic

Merge branch 'main' into users/rovka/alloc-vgpr-intrinsic

Delta		File
+17,522	-20,773	llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998	-11,093	llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981	-11,098	llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+5,975	-8,879	llvm/test/CodeGen/AMDGPU/shufflevector.v4i64.v4i64.ll
+5,975	-8,879	llvm/test/CodeGen/AMDGPU/shufflevector.v4p0.v4p0.ll
+7,387	-7,087	llvm/test/tools/llvm-dwarfdump/X86/simplified-template-names.s
+54,838	-67,809	8,748 files not shown
+511,427	-454,059	8,754 files

LLVM/project f5e6cb0 — llvm/test/tools/llvm-mca/AArch64/Neoverse V3AE-neon-instructions.s V2-neon-instructions.s

2025-12-10 08:26:30 UTC by Diana Picus via GitHub on ⎇

users/rovka/relax-callers-for-chain-funcs

Merge branch 'main' into users/rovka/relax-callers-for-chain-funcs

Delta		File
+780	-1,347	llvm/test/tools/llvm-mca/AArch64/Neoverse/V3AE-neon-instructions.s
+780	-1,347	llvm/test/tools/llvm-mca/AArch64/Neoverse/V2-neon-instructions.s
+780	-1,347	llvm/test/tools/llvm-mca/AArch64/Neoverse/V3-neon-instructions.s
+1,000	-1,084	llvm/test/tools/llvm-mca/AArch64/Neoverse/N2-neon-instructions.s
+1,000	-1,084	llvm/test/tools/llvm-mca/AArch64/Neoverse/N3-neon-instructions.s
+1,000	-1,084	llvm/test/tools/llvm-mca/AArch64/Neoverse/N1-neon-instructions.s
+5,340	-7,293	1,433 files not shown
+42,274	-31,004	1,439 files

LLVM/project 62dbe57 — compiler-rt/lib/sanitizer_common sanitizer_linux.cpp sanitizer_platform_limits_posix.h

2025-12-10 08:09:54 UTC by Brad Smith via GitHub on ⎇

main

[compiler-rt][sanitizer] fix i386 build for Haiku (#171075)

r13 does not provide the trap err.

Co-authored-by: Jerome Duval <jerome.duval at gmail.com>

Delta		File
+10	-2	compiler-rt/lib/sanitizer_common/sanitizer_linux.cpp
+1	-1	compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_posix.h
+11	-3	2 files

LLVM/project 56e092c — llvm/lib/Target/RISCV/GISel RISCVLegalizerInfo.cpp, llvm/test/CodeGen/RISCV/GlobalISel fp-fcanonicalize.ll legalizer-info-validation.mir

2025-12-10 07:45:01 UTC by Chuan-Yue Yuan via GitHub on ⎇

main

[RISCV][GISel] Legalize the G_FCANONICALIZE instruction (#166162)

Delta		File
+66	-0	llvm/test/CodeGen/RISCV/GlobalISel/fp-fcanonicalize.ll
+5	-0	llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
+2	-2	llvm/test/CodeGen/RISCV/GlobalISel/legalizer-info-validation.mir
+73	-2	3 files

LLVM/project 885c344 — mlir/include/mlir/Dialect/OpenMP OpenMPClauses.td OpenMPOps.td, mlir/lib/Dialect/OpenMP/IR OpenMPDialect.cpp

2025-12-10 07:44:02 UTC by skc7 on ⎇

users/skc7/omp_teams_multidim

use dims modifer in main num_teams clause itself instead of creating new clause

Delta		File
+161	-51	mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
+48	-80	mlir/include/mlir/Dialect/OpenMP/OpenMPClauses.td
+76	-1	mlir/test/Dialect/OpenMP/invalid.mlir
+2	-2	mlir/test/Dialect/OpenMP/ops.mlir
+1	-2	mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
+288	-136	5 files

LLVM/project 43f0b69 — clang/include/clang/Basic Builtins.def, clang/lib/AST ASTContext.cpp

2025-12-10 07:42:12 UTC by Rana Pratap Reddy via GitHub on ⎇

main

[AMDGPU] Removal of language sensitive option for _Float16 and half('e') handling (#170443)

Removing the 'e' handling for the amdgcn builtins as we decided to use
_Float16 for both HIP/C++ and OpenCL

Delta		File
+2	-6	clang/lib/AST/ASTContext.cpp
+0	-1	clang/include/clang/Basic/Builtins.def
+2	-7	2 files

LLVM/project bd9651b — mlir/include/mlir/Bindings/Python NanobindAdaptors.h, mlir/test/python/dialects pdl_types.py

2025-12-10 07:37:45 UTC by Oleksandr "Alex" Zinenko via GitHub on ⎇

main

[mlir][py] avoid crashing on None contexts in custom `get`s (#171140)

Following a series of refactorings, MLIR Python bindings would crash if
a
dialect object requiring a context defined using
mlir_attribute/type_subclass
was constructed outside of the `ir.Context` context manager. The type
caster
for `MlirContext` would try using `ir.Context.current` when the default
`None`
value was provided to the `get`, which would also just return `None`.
The
caster would then attempt to obtain the MLIR capsule for that `None`,
fail,
but access it anyway without checking, leading to a C++ assertion
failure or
segfault.

Guard against this case in nanobind adaptors. Also emit a warning to the

    [13 lines not shown]

Delta		File
+14	-6	mlir/include/mlir/Bindings/Python/NanobindAdaptors.h
+13	-0	mlir/test/python/dialects/pdl_types.py
+27	-6	2 files

LLVM/project b939293 — llvm/lib/Target/LoongArch LoongArchISelLowering.cpp

2025-12-10 07:24:34 UTC by WANG Rui on ⎇

users/hev/issue-170976

Address weining's comments

Delta		File
+6	-10	llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
+6	-10	1 files

LLVM/project 857b68f — llvm/include/llvm/MC MCInstrDesc.h, llvm/test/TableGen target-specialized-pseudos.td RegClassByHwMode.td

2025-12-10 07:17:58 UTC by Jay Foad via GitHub on ⎇

main

[MC] Reorder TARGETInstrTable to shrink MCInstrDesc::ImplicitOffset (#171199)

Put ImplicitOps[] before OperandInfo[] in the generated
TARGETInstrTable. This means that offsets to entries into the (small)
ImplicitOps table do not need to skip over the (large) OperandInfo
table.

This allows shrinking ImplicitOffset from 32 bits to 16 bits
(effectively reverting #138127) which will allow expanding Opcode
instead without increasing the size of MCInstrDesc.

Delta		File
+30	-16	llvm/utils/TableGen/InstrInfoEmitter.cpp
+6	-6	llvm/test/TableGen/target-specialized-pseudos.td
+1	-1	llvm/test/TableGen/RegClassByHwMode.td
+1	-1	llvm/include/llvm/MC/MCInstrDesc.h
+38	-24	4 files

LLVM/project 68db47a — mlir/include/mlir/Bindings/Python NanobindAdaptors.h, mlir/test/python/dialects pdl_types.py

2025-12-10 07:17:17 UTC by Alex Zinenko on ⎇

users/ftynse/mlir-none-context-crash

[mlir][py] avoid crashing on None contexts in custom `get`s

Following a series of refactorings, MLIR Python bindings would crash if a
dialect object requiring a context defined using mlir_attribute/type_subclass
was constructed outside of the `ir.Context` context manager. The type caster
for `MlirContext` would try using `ir.Context.current` when the default `None`
value was provided to the `get`, which would also just return `None`. The
caster would then attempt to obtain the MLIR capsule for that `None`, fail,
but access it anyway without checking, leading to a C++ assertion failure or
segfault.

Guard against this case in nanobind adaptors. Also emit a warning to the user
to clarify expectations, as the default message confusingly says that `None` is
accepted as context and then fails with a type error. Using Python C API is
currently recommended by nanobind in this case since the surrounding function
must be marked `noexcept`.

The corresponding test is in the PDL dialect since it is where I first observed
the behavior. Core types are not using the `mlir_type_subclass` mechanism and
are immune to the problem, so cannot be used for checking.

Delta		File
+14	-6	mlir/include/mlir/Bindings/Python/NanobindAdaptors.h
+13	-0	mlir/test/python/dialects/pdl_types.py
+27	-6	2 files

LLVM/project 1c7126d — llvm/lib/Transforms/Vectorize VPlan.h VPlanValue.h

2025-12-10 07:09:21 UTC by Ramkumar Ramachandra via GitHub on ⎇

main

[VPlan] Combine LiveIns fields into MapVector (NFC) (#170220)

Combine Value2VPValue and VPLiveIns into a single MapVector LiveIns
field, simplifying users.

Delta		File
+7	-19	llvm/lib/Transforms/Vectorize/VPlan.h
+0	-5	llvm/lib/Transforms/Vectorize/VPlanValue.h
+7	-24	2 files

LLVM/project 5489010 — flang-rt/lib/runtime assign.cpp

2025-12-10 06:57:25 UTC by Michael Klemm via GitHub on ⎇

main

[Flang-rt] Implement same behavior as -O3 for zero-length arrays (#171480)

Delta		File
+1	-1	flang-rt/lib/runtime/assign.cpp
+1	-1	1 files

LLVM/project dc8fde0 — mlir/lib/Target/LLVMIR ModuleImport.cpp, mlir/test/Target/LLVMIR/Import exception.ll constant.ll

2025-12-10 06:41:54 UTC by Bala_Bhuvan_Varma via GitHub on ⎇

main

[MLIR][LLVMIR] Fix LLVM IR import of ZeroInitializers to constant zero (#171107)

Constant zero aggregate structures are imported to from llvm IR as
undef.
This includes for example LandingPad Instructions which have zero value
filters, structs.

This fixes the import to use the zeroOp to materialize a
zero-initialized constant.

Delta		File
+55	-3	mlir/test/Target/LLVMIR/Import/exception.ll
+41	-7	mlir/test/Target/LLVMIR/Import/constant.ll
+12	-13	mlir/lib/Target/LLVMIR/ModuleImport.cpp
+2	-4	mlir/test/Target/LLVMIR/Import/zeroinitializer.ll
+110	-27	4 files

LLVM/project f40c8e7 — lld/ELF Target.h

2025-12-10 06:33:01 UTC by Fangrui Song on ⎇

main

ELF: Remove stray ;. NFC

Delta		File
+1	-1	lld/ELF/Target.h
+1	-1	1 files

LLVM/project fde8dc7 — clang/include/clang/Basic BuiltinsAMDGPU.def, clang/lib/CodeGen/TargetBuiltins AMDGPU.cpp

2025-12-10 06:26:26 UTC by Aaditya on ⎇

users/easyonaadit/amdgpu/wave-reduce-intrinsics-double-builtins

[AMDGPU] Add builtins for wave reduction intrinsics

Delta		File
+84	-0	clang/test/CodeGenOpenCL/builtins-amdgcn.cl
+8	-0	clang/lib/CodeGen/TargetBuiltins/AMDGPU.cpp
+4	-0	clang/include/clang/Basic/BuiltinsAMDGPU.def
+96	-0	3 files

LLVM/project d0d9992 — llvm/lib/Target/AMDGPU SIISelLowering.cpp SIInstructions.td, llvm/test/CodeGen/AMDGPU llvm.amdgcn.reduce.fadd.ll llvm.amdgcn.reduce.fsub.ll

2025-12-10 06:26:18 UTC by Aaditya on ⎇

users/easyonaadit/amdgpu/wave-reduce-intrinsics-double-add-sub

[AMDGPU] Add wave reduce intrinsics for double types - 2

Supported Ops: `add`, `sub`

Delta		File
+1,115	-0	llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.fadd.ll
+1,102	-0	llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.fsub.ll
+80	-19	llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2	-0	llvm/lib/Target/AMDGPU/SIInstructions.td
+2,299	-19	4 files

LLVM/project efda519 — llvm/test/Transforms/LoopVectorize/AArch64 predicated-costs.ll

2025-12-10 06:13:32 UTC by Luke Lau via GitHub on ⎇

main

[LV] Use branch_weights metadata in getPredBlockCostDivisor test. NFC (#171299)

This is more reliable in the event that the trivial fcmp gets folded
away.

Delta		File
+13	-10	llvm/test/Transforms/LoopVectorize/AArch64/predicated-costs.ll
+13	-10	1 files

LLVM/project d18cdc9 — llvm/include/llvm/TargetParser RISCVTargetParser.h, llvm/lib/Target/RISCV RISCVInsertVSETVLI.cpp

2025-12-10 06:06:15 UTC by Craig Topper via GitHub on ⎇

main

[RISCVInsertVSETVLI] Don't allow getSEW/getLMUL to be called for hasSEWLMULRatioOnly(). NFC (#171554)

Refactor some logic in transferBefore to handle hasSEWLMULRatioOnly()
before calling getSEW/getLMUL.

Update adjustIncoming to use getSEWLMULRatio(). Update the interface of
RISCVVType::getSameRatioLMUL to take the ratio instead of SEW and LMUL.
Update the few other callers to call RISCVVType::getSEWLMULRatio first.

Delta		File
+21	-21	llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
+12	-6	llvm/unittests/TargetParser/RISCVTargetParserTest.cpp
+2	-1	llvm/lib/Target/RISCV/MCA/RISCVCustomBehaviour.cpp
+1	-2	llvm/lib/TargetParser/RISCVTargetParser.cpp
+1	-2	llvm/include/llvm/TargetParser/RISCVTargetParser.h
+37	-32	5 files

LLVM/project fcdac81 — llvm/lib/Target/RISCV RISCVInstrInfoF.td

2025-12-10 06:05:44 UTC by Craig Topper via GitHub on ⎇

main

[RISCV] Use frmarg instead of ixlenimm in PseudoFROUND. NFC (#171563)

This is expanded with a custom inserter and this immediate will be
copied to the frm operand of a non-pseudo instruction.

Delta		File
+1	-1	llvm/lib/Target/RISCV/RISCVInstrInfoF.td
+1	-1	1 files

LLVM/project 13d99e3 — llvm/lib/Target/RISCV RISCVSchedSiFiveP600.td

2025-12-10 05:59:13 UTC by Pengcheng Wang via GitHub on ⎇

main

[RISCV] Fix wrong use of SiFiveP400GetVLMAX in RISCVSchedSiFiveP600 (#171562)

There is no difference of functionality and I believe this is a
copy-paste mistake. :-)

Delta		File
+1	-1	llvm/lib/Target/RISCV/RISCVSchedSiFiveP600.td
+1	-1	1 files

LLVM/project 7464b86 — llvm/lib/Target/AArch64 AArch64Features.td AArch64Processors.td, llvm/test/CodeGen/AArch64 sve-st1-addressing-mode-reg-imm.ll sve-ld1-addressing-mode-reg-imm.ll

2025-12-10 05:56:08 UTC by Kinoshita Kotaro via GitHub on ⎇

main

[AArch64][SVE] Add SubtargetFeature to disable lowering unpredicated loads/stores as LDR/STR (#170256)

PR #127837 changed the lowering for unpredicated loads/stores to use LDR/STR instead of LD1/ST1.
However, on some CPUs, such as A64FX, there is a performance difference between LD1/ST1 and LDR/STR.
As a result, the lowering introduced in #127837 can cause a performance regression on these targets.
This patch adds a SubtargetFeature `disable-unpredicated-ld-st-lower` to disable this lowering.
It is enabled for the A64FX target.

Delta		File
+102	-0	llvm/test/CodeGen/AArch64/sve-st1-addressing-mode-reg-imm.ll
+100	-0	llvm/test/CodeGen/AArch64/sve-ld1-addressing-mode-reg-imm.ll
+4	-0	llvm/lib/Target/AArch64/AArch64Features.td
+2	-1	llvm/lib/Target/AArch64/AArch64Processors.td
+2	-0	llvm/lib/Target/AArch64/AArch64InstrInfo.td
+1	-1	llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
+211	-2	6 files

LLVM/project 2ea1bfd — llvm/lib/Target/AMDGPU SIShrinkInstructions.cpp

2025-12-10 05:48:06 UTC by vikhegde via Vikram Hegde on ⎇

users/vikramRH/npm/npm_siShrink_changed

[AMDGPU] Make SIShrinkInstructions pass return valid changed state

Delta		File
+60	-35	llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp
+60	-35	1 files

LLVM/project 2f61427 — llvm/lib/Target/AMDGPU SIShrinkInstructions.cpp

2025-12-10 05:48:06 UTC by vikhegde via Vikram Hegde on ⎇

users/vikramRH/npm/npm_siShrink_changed

clang-format

Delta		File
+5	-3	llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp
+5	-3	1 files

LLVM/project aebab05 — llvm/include/llvm/Passes CodeGenPassBuilder.h, llvm/test/CodeGen/AMDGPU llc-pipeline-npm.ll

2025-12-10 05:45:37 UTC by Vikram Hegde via GitHub on ⎇

main

[NPM] Schedule PhysicalRegisterUsageAnalysis before RegUsageInfoCollectorPass (#168832)

RegUsageInfoCollectorPass requires PhysicalRegisterUsageAnalysis to be valid. this change is required since its a module analysis.

Delta		File
+3	-3	llvm/test/CodeGen/AMDGPU/llc-pipeline-npm.ll
+3	-1	llvm/include/llvm/Passes/CodeGenPassBuilder.h
+6	-4	2 files

LLVM/project 570bcea — compiler-rt/lib/scudo/standalone primary64.h

2025-12-10 05:08:44 UTC by Sadaf Ebrahimi via GitHub on ⎇

main

[scudo] Add last release time info to getStats (#170902)

Knowing when the last page release happened can help us figure out if
the page release is skipped or not.

Delta		File
+15	-1	compiler-rt/lib/scudo/standalone/primary64.h
+15	-1	1 files

LLVM/project 21871bb — llvm/lib/Target/RISCV RISCVISelLowering.cpp RISCVRegisterInfo.td, llvm/test/CodeGen/RISCV/rvv pr171141.ll subregister-undef-early-clobber.mir

2025-12-10 04:59:33 UTC by Craig Topper via GitHub on ⎇

main

[RISCV] Add fractional LMUL register classes for inline assembly. (#171278)

Inline assembly uses the first type from the register class to
connect to the rest of SelectionDAG. By adding fractional LMUL
register classes, we can ensure that this type is the size of the
types we use for fractional LMUL in the rest of SelectionDAG.

This allows us to remove some of the handling we had in
splitValueIntoRegisterParts/joinRegisterPartsIntoValue. This code
was incorrectly handling v16i4 arguments/returns which should be
any_extend to v16i8 to match type legalization. Instead we widened
v16i4 -> v32i4 then bitcasted to v16i8. This merged pairs of i4
elements into an i8 element instead of keeping them as separate
elements that have been extended to i8.

This is an alternative to #171243.
    
Fixes #171141.

Delta		File
+27	-24	llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+18	-1	llvm/lib/Target/RISCV/RISCVRegisterInfo.td
+12	-0	llvm/test/CodeGen/RISCV/rvv/pr171141.ll
+2	-2	llvm/test/CodeGen/RISCV/rvv/subregister-undef-early-clobber.mir
+59	-27	4 files

LLVM/project 84e4a4c — clang/include/clang/Basic Builtins.def, clang/lib/AST ASTContext.cpp

2025-12-10 04:59:23 UTC by ranapratap55 on ⎇

users/ranapratap55/Removal-of-e-handling

[AMDGPU] Removal of language sensitive option for _Float16 and half( 'e') handling

Delta		File
+2	-6	clang/lib/AST/ASTContext.cpp
+0	-1	clang/include/clang/Basic/Builtins.def
+2	-7	2 files