LLVM/project a0b6638llvm/lib/Target/RISCV RISCVTargetTransformInfo.cpp, llvm/test/Transforms/LoopUnroll/RISCV vector.ll

[RISCV] Don't unroll vectorized loops with vector operands (#171089)

We have disabled unrolling for vectorized loops in #151525 but this
PR only checked the instruction type.

For some loops, there is no instruction with vector type but they
are still vector operations (just like the memset zero test in the
precommit test).

Here we check the operands as well to cover these cases.
DeltaFile
+17-98llvm/test/Transforms/LoopUnroll/RISCV/vector.ll
+4-1llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
+21-992 files

LLVM/project abe7ec4clang/test/Headers __clang_hip_math.hip, llvm/lib/Transforms/Scalar StraightLineStrengthReduce.cpp

fix missing newline

Created using spr 1.3.8-beta.1
DeltaFile
+2,027-185llvm/test/CodeGen/X86/shift-i512.ll
+1,563-413llvm/test/CodeGen/X86/bitcnt-big-integer.ll
+825-755clang/test/Headers/__clang_hip_math.hip
+883-273llvm/lib/Transforms/Scalar/StraightLineStrengthReduce.cpp
+2-668llvm/test/CodeGen/RISCV/rv32p.ll
+323-320llvm/test/CodeGen/AMDGPU/llvm.exp10.ll
+5,623-2,614984 files not shown
+18,739-13,559990 files

LLVM/project 780b0feclang/test/Headers __clang_hip_math.hip, llvm/lib/Transforms/Scalar StraightLineStrengthReduce.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.8-beta.1

[skip ci]
DeltaFile
+2,027-185llvm/test/CodeGen/X86/shift-i512.ll
+1,563-413llvm/test/CodeGen/X86/bitcnt-big-integer.ll
+825-755clang/test/Headers/__clang_hip_math.hip
+883-273llvm/lib/Transforms/Scalar/StraightLineStrengthReduce.cpp
+2-668llvm/test/CodeGen/RISCV/rv32p.ll
+323-320llvm/test/CodeGen/AMDGPU/llvm.exp10.ll
+5,623-2,614983 files not shown
+18,734-13,554989 files

LLVM/project e5a1821clang/include/clang/Basic OpenCLExtensions.def, clang/test/SemaOpenCL extension-version.cl

[OpenCL] Add missing Intel extensions to OpenCLExtensions.def (#169875)

Add following extensions:
cl_intel_bfloat16_conversion
cl_intel_subgroup_buffer_prefetch
cl_intel_subgroup_local_block_io
cl_intel_subgroups_char
cl_intel_subgroups_long

This allows targets to expose these extensions via
getSupportedOpenCLOpts and ensures macros are defined when enabled.
DeltaFile
+36-0clang/test/SemaOpenCL/extension-version.cl
+5-0clang/include/clang/Basic/OpenCLExtensions.def
+41-02 files

LLVM/project a859ff9llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll shift-i512.ll

Merge branch 'main' into broadcastpropagation
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,857-10,952llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,840-10,957llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,091-0llvm/test/CodeGen/AMDGPU/atomicrmw_usub_sat.ll
+1,541-1,541llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+2,027-185llvm/test/CodeGen/X86/shift-i512.ll
+42,878-44,4082,285 files not shown
+129,195-93,1652,291 files

LLVM/project bf8558emlir/include/mlir/Dialect/XeGPU/IR XeGPUAttrs.td, mlir/lib/Dialect/XeGPU/IR XeGPUDialect.cpp

add setUnitDimData/Layout interface and refactor
DeltaFile
+125-0mlir/lib/Dialect/XeGPU/IR/XeGPUDialect.cpp
+45-29mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp
+20-0mlir/include/mlir/Dialect/XeGPU/IR/XeGPUAttrs.td
+2-6mlir/lib/Dialect/XeGPU/Transforms/XeGPUPropagateLayout.cpp
+192-354 files

LLVM/project 621a15dllvm/test/CodeGen/Mips setcc-se.ll micromips-pseudo-mtlohi-expand.ll, llvm/test/CodeGen/Mips/llvm-ir load.ll store.ll

[𝘀𝗽𝗿] initial version

Created using spr 1.3.8-beta.1
DeltaFile
+795-795llvm/test/CodeGen/Mips/llvm-ir/load.ll
+450-450llvm/test/CodeGen/Mips/llvm-ir/store.ll
+157-157llvm/test/CodeGen/Mips/setcc-se.ll
+148-148llvm/test/CodeGen/Mips/llvm-ir/fptosi.ll
+66-66llvm/test/MC/Lanai/memory.s
+40-40llvm/test/CodeGen/Mips/micromips-pseudo-mtlohi-expand.ll
+1,656-1,6565 files not shown
+1,702-1,70211 files

LLVM/project 893479allvm/test/Transforms/LoopUnroll/RISCV vector.ll

[RISCV] Precommit test for unrolling loops with vector operands
DeltaFile
+210-7llvm/test/Transforms/LoopUnroll/RISCV/vector.ll
+210-71 files

LLVM/project f715356llvm/lib/Transforms/IPO LowerTypeTests.cpp, llvm/test/Transforms/LowerTypeTests import.ll

[LTT] Add `unknown` branch weights when lowering type tests with conditional
DeltaFile
+13-10llvm/test/Transforms/LowerTypeTests/import.ll
+5-1llvm/lib/Transforms/IPO/LowerTypeTests.cpp
+0-2llvm/utils/profcheck-xfail.txt
+18-133 files

LLVM/project f2ddc7bllvm/test/Transforms/LowerTypeTests import.ll

[NFC] Run UTC --check-globals on LowerTypeTests/import.ll (#170755)

DeltaFile
+154-136llvm/test/Transforms/LowerTypeTests/import.ll
+154-1361 files

LLVM/project 898963bllvm/lib/Target/AArch64 AArch64LowerHomogeneousPrologEpilog.cpp, llvm/test/CodeGen/AArch64 arm64-homogeneous-prolog-epilog-tail-call.mir

[AArch64] Fix missing register definitions in homogeneous epilog lowering (#171118)

The lowering for HOM_Epilog did not transfer explicit register defs from
the pseudo-instruction to the generated helper calls. MachineVerifier
would complain if a following tail call uses one of the restored CSRs.
This scenario occurs in code generated by the Swift compiler, where X20
is used to pass swiftself.

This patch fixes the issue by adding the missing defs back to the helper
call as implicit defs.
DeltaFile
+28-0llvm/test/CodeGen/AArch64/arm64-homogeneous-prolog-epilog-tail-call.mir
+17-10llvm/lib/Target/AArch64/AArch64LowerHomogeneousPrologEpilog.cpp
+45-102 files

LLVM/project 9fc1c49libclc/opencl/include/clc/opencl/atomic atomic_decl.inc atomic_flag_test_and_set.h, libclc/opencl/include/clc/opencl/image image_defines.h image.h

[NFC][libclc] Delete OpenCL builtin declarations (#170803)

This is follow-up of comment
https://github.com/llvm/llvm-project/pull/168318#discussion_r2588117855

libclc OpenCL library is already compiled with flag
`-fdeclare-opencl-builtins -finclude-default-header`.
DeltaFile
+0-134libclc/opencl/include/clc/opencl/atomic/atomic_decl.inc
+0-91libclc/opencl/include/clc/opencl/shared/vstore.h
+0-88libclc/opencl/include/clc/opencl/shared/vload.h
+0-62libclc/opencl/include/clc/opencl/image/image_defines.h
+0-55libclc/opencl/include/clc/opencl/image/image.h
+0-49libclc/opencl/include/clc/opencl/atomic/atomic_flag_test_and_set.h
+0-479470 files not shown
+31-5,389476 files

LLVM/project 77c75b4clang-tools-extra/docs ReleaseNotes.rst

release notes

Created using spr 1.3.7
DeltaFile
+5-0clang-tools-extra/docs/ReleaseNotes.rst
+5-01 files

LLVM/project 8c8196cllvm/lib/Target/AMDGPU SIInsertWaitcnts.cpp

[AMDGPU][NFC] cleanup whitespace in debug log of SIInsertWaitcnts
DeltaFile
+13-13llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
+13-131 files

LLVM/project 6580c09clang-tools-extra/docs/clang-tidy/checks/abseil unchecked-statusor-access.rst

doc

Created using spr 1.3.7
DeltaFile
+377-0clang-tools-extra/docs/clang-tidy/checks/abseil/unchecked-statusor-access.rst
+377-01 files

LLVM/project 1e33e12llvm/lib/Target/AMDGPU SIInsertWaitcnts.cpp

[AMDGPU][NFC] fix function names in debug log for SIInsertWaitcnts
DeltaFile
+4-4llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
+4-41 files

LLVM/project d020c4bclang-tools-extra/test/clang-tidy/checkers/abseil abseil-unchecked-statusor-access.cpp, clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs type_traits initializer_list

test

Created using spr 1.3.7
DeltaFile
+427-0clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs/type_traits
+346-0clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs/absl/status/statusor.h
+138-0clang-tools-extra/test/clang-tidy/checkers/abseil/abseil-unchecked-statusor-access.cpp
+69-0clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs/absl/status/status.h
+46-0clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs/absl/meta/type_traits.h
+11-0clang-tools-extra/test/clang-tidy/checkers/abseil/Inputs/initializer_list
+1,037-01 files not shown
+1,047-07 files

LLVM/project cdb525dllvm/lib/Target/NVPTX NVPTXISelLowering.cpp NVPTXSubtarget.h, llvm/test/CodeGen/NVPTX used-bytes-mask.ll param-vectorize-device.ll

[NVPTX] Fix lit test issue from used_bytes_mask (#171220)

Whoops, I made the same mistake as
https://github.com/llvm/llvm-project/pull/169535/files again, so
https://github.com/llvm/llvm-project/commit/5c8c7f3d21ce37bfecd671de4e139a4d933f9be9
is causing build issues.
DeltaFile
+38-0llvm/test/CodeGen/NVPTX/used-bytes-mask.ll
+11-4llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
+3-0llvm/lib/Target/NVPTX/NVPTXSubtarget.h
+0-2llvm/test/CodeGen/NVPTX/param-vectorize-device.ll
+0-1llvm/test/CodeGen/NVPTX/LoadStoreVectorizer.ll
+52-75 files

LLVM/project c05ba63llvm/lib/Target/PowerPC PPCISelLowering.cpp, llvm/test/CodeGen/PowerPC vec_rounding.ll

[PowerPC] Use the same lowering rule for vector rounding instructions (#166307)

They should have the same lowering rule.
DeltaFile
+324-0llvm/test/CodeGen/PowerPC/vec_rounding.ll
+2-2llvm/lib/Target/PowerPC/PPCISelLowering.cpp
+326-22 files

LLVM/project a1f9489flang/lib/Optimizer/Transforms FIRToSCF.cpp, flang/test/Fir/FirToSCF iter-while.fir

[Flang][Fir] Fix the comparison when lowering `fir.iterate_while` to `scf.while`. (#171080)

The comparison depends on the sign of the `step`, and when `step == 0` it always returns `false`.
DeltaFile
+144-48flang/test/Fir/FirToSCF/iter-while.fir
+16-3flang/lib/Optimizer/Transforms/FIRToSCF.cpp
+160-512 files

LLVM/project a215696mlir/include/mlir/Conversion/MathToAPFloat MathToAPFloat.h, mlir/include/mlir/Dialect/Func/Utils Utils.h

[mlir][math] Add FP software implementation lowering pass: math-to-apfloat
DeltaFile
+100-0mlir/lib/Conversion/MathToAPFloat/MathToAPFloat.cpp
+14-53mlir/lib/Conversion/ArithToAPFloat/ArithToAPFloat.cpp
+38-0mlir/lib/Dialect/Func/Utils/Utils.cpp
+21-0mlir/include/mlir/Conversion/MathToAPFloat/MathToAPFloat.h
+17-0mlir/lib/Conversion/MathToAPFloat/CMakeLists.txt
+16-0mlir/include/mlir/Dialect/Func/Utils/Utils.h
+206-534 files not shown
+234-5410 files

LLVM/project 3686ff2bolt/lib/Target/AArch64 AArch64MCPlusBuilder.cpp, llvm/lib/Target/AArch64 AArch64AsmPrinter.cpp AArch64A53Fix835769.cpp

[AArch64] Treat NOP as a separate instruction. (#170968)

Previously, nop was treated as just an alias for hint #0. The
consequence of that was that all the general rules for hint instructions
applied to nop too, in particular that during binary analysis, they were
assumed to have unknown effects. This commit adds AArch64::NOP as a
standalone instruction with no side effects.

The scheduling update in A55-load-store-alias.s is probably not entirely
accurate, but should be more accurate than the previous result.
DeltaFile
+24-24llvm/test/tools/llvm-mca/AArch64/Cortex/A55-load-store-alias.s
+5-5llvm/lib/Target/AArch64/AArch64AsmPrinter.cpp
+3-4llvm/lib/Target/AArch64/AArch64A53Fix835769.cpp
+5-1llvm/lib/Target/AArch64/AArch64InstrInfo.td
+2-4llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
+2-4bolt/lib/Target/AArch64/AArch64MCPlusBuilder.cpp
+41-421 files not shown
+42-437 files

LLVM/project 1e43eb1mlir/include/mlir/Conversion/MathToAPFloat MathToAPFloat.h, mlir/include/mlir/Dialect/Func/Utils Utils.h

[mlir][math] Add FP software implementation lowering pass: math-to-apfloat
DeltaFile
+14-53mlir/lib/Conversion/ArithToAPFloat/ArithToAPFloat.cpp
+52-0mlir/lib/Conversion/MathToAPFloat/MathToAPFloat.cpp
+38-0mlir/lib/Dialect/Func/Utils/Utils.cpp
+21-0mlir/include/mlir/Conversion/MathToAPFloat/MathToAPFloat.h
+17-0mlir/lib/Conversion/MathToAPFloat/CMakeLists.txt
+16-0mlir/include/mlir/Dialect/Func/Utils/Utils.h
+158-533 files not shown
+175-539 files

LLVM/project c595282mlir CMakeLists.txt

[mlir][CMake] enable disabling MLIR_ENABLE_EXECUTION_ENGINE (#171060)

Currently if you pass MLIR_ENABLE_EXECUTION_ENGINE=OFF it's overwritten.
DeltaFile
+5-2mlir/CMakeLists.txt
+5-21 files

LLVM/project 392c302clang/lib/AST/ByteCode InterpBuiltin.cpp

[Clang] Fix unused variable warning from 1911ce132659222aee353882bd55… (#171223)

…70d689745a7d

These are only used in assertions so trigger warnings in release builds.
Fix this per the LLVM programming standards.
DeltaFile
+4-4clang/lib/AST/ByteCode/InterpBuiltin.cpp
+4-41 files

LLVM/project cda8bfamlir/include/mlir/Conversion/MathToAPFloat MathToAPFloat.h, mlir/include/mlir/Dialect/Func/Utils Utils.h

[mlir][math] Add FP software implementation lowering pass: math-to-apfloat
DeltaFile
+14-53mlir/lib/Conversion/ArithToAPFloat/ArithToAPFloat.cpp
+52-0mlir/lib/Conversion/MathToAPFloat/MathToAPFloat.cpp
+38-0mlir/lib/Dialect/Func/Utils/Utils.cpp
+21-0mlir/include/mlir/Conversion/MathToAPFloat/MathToAPFloat.h
+17-0mlir/lib/Conversion/MathToAPFloat/CMakeLists.txt
+16-0mlir/include/mlir/Dialect/Func/Utils/Utils.h
+158-533 files not shown
+175-539 files

LLVM/project 05b7720clang/include/clang/CIR/Dialect/IR CIROps.td, clang/lib/CIR/CodeGen CIRGenBuiltinX86.cpp

[CIR][X86] Implement lowering for sqrt builtins (#169310)

Implements CIR IR generation for X86-specific sqrt builtin functions,
addressing issue #167765.

## Test Results 

Successfully tested the implementation locally. All tests pass:

```bash
$ ./bin/llvm-lit -v ../clang/test/CIR/CodeGen/X86/cir-sqrt-builtins.c

Testing: 1 tests, 1 workers
PASS: Clang :: CIR/CodeGen/X86/cir-sqrt-builtins.c (1 of 1)

Testing Time: 1.18s
Total Discovered Tests: 1
  Passed: 1 (100.00%)
```

    [4 lines not shown]
DeltaFile
+45-0clang/test/CIR/CodeGen/X86/cir-sqrt-builtins.c
+21-0clang/include/clang/CIR/Dialect/IR/CIROps.td
+7-3clang/lib/CIR/CodeGen/CIRGenBuiltinX86.cpp
+8-0clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
+81-34 files

LLVM/project 49f813amlir/include/mlir/Conversion Passes.td, mlir/include/mlir/Conversion/MathToAPFloat MathToAPFloat.h

[mlir][math] Add FP software implementation lowering pass: math-to-apfloat
DeltaFile
+185-0mlir/lib/Conversion/MathToAPFloat/MathToAPFloat.cpp
+21-0mlir/include/mlir/Conversion/MathToAPFloat/MathToAPFloat.h
+17-0mlir/lib/Conversion/MathToAPFloat/CMakeLists.txt
+15-0mlir/include/mlir/Conversion/Passes.td
+1-0mlir/lib/Conversion/CMakeLists.txt
+239-05 files

LLVM/project 786498bllvm/lib/Target/AMDGPU AMDGPUISelLowering.cpp, llvm/test/CodeGen/AMDGPU trunc-store.ll fp_trunc_store_fp32_to_bf16.ll

AMDGPU: Fix truncstore from v6f32 to v6f16 (#171212)

The v6bf16 cases work, but that's likely because v6bf16 isn't
currently an MVT.

Fixes: SWDEV-570985
DeltaFile
+125-0llvm/test/CodeGen/AMDGPU/trunc-store.ll
+48-0llvm/test/CodeGen/AMDGPU/fp_trunc_store_fp32_to_bf16.ll
+14-0llvm/test/CodeGen/AMDGPU/trunc-store-f64-to-f16.ll
+1-0llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+188-04 files

LLVM/project 0ce6d56lldb/source/Commands CommandObjectBreakpoint.cpp, lldb/test/API/functionalities/breakpoint/breakpoint_by_line_and_column TestBreakpointByLineAndColumn.py

Fix a typo in "breakpoint add file" and add a test (#171206)

lldbutil.run_to_line_breakpoint had usages that set column breakpoints,
so I thought there was coverage of that on the command-line, but
actually all the `run_to` utilities use the SB API's, and there weren't
any tests of setting file line & column breakpoint through
`run_break_set`. So I missed that I had typed the column option `c` -
that's taken by `--command`.

This patch fixes that typo and adds a CLI test for file + line + column.
DeltaFile
+19-0lldb/test/API/functionalities/breakpoint/breakpoint_by_line_and_column/TestBreakpointByLineAndColumn.py
+1-1lldb/source/Commands/CommandObjectBreakpoint.cpp
+20-12 files