LLVM/project 5099c07compiler-rt/lib/sanitizer_common sanitizer_platform_limits_posix.h sanitizer_linux.cpp

[compiler-rt][sanitizer] some additional Haiku build fixes (#172773)

- Correct field type for cookie variable
- Use proper type for __sanitizer_sigset_t on 64-bit archs
DeltaFile
+3-1compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_posix.h
+1-1compiler-rt/lib/sanitizer_common/sanitizer_linux.cpp
+4-22 files

LLVM/project f24c0ecllvm/test/Feature codegen-plugin.ll

[LLVM] Fix back-end plugin test for AIX
DeltaFile
+3-1llvm/test/Feature/codegen-plugin.ll
+3-11 files

LLVM/project 745656dmlir/python CMakeLists.txt

try fix windows badcast
DeltaFile
+7-1mlir/python/CMakeLists.txt
+7-11 files

LLVM/project 8476a60llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp, llvm/test/CodeGen/AMDGPU rsq.f32-safe.ll amdgpu-codegenprepare-fdiv.ll

AMDGPU: Stop requiring afn for f32 rsq formation

We were checking for afn or !fpmath attached to the sqrt. We
are not trying to replace a correctly rounded rsqrt; we're replacing
the two correctly rounded operations with the contracted operation.
It's net a better precision, so contract on both instructions should
be sufficient. Both the contracted and uncontracted sequences pass
the OpenCL conformance test, with a lower maximum error contracted.
DeltaFile
+504-1,529llvm/test/CodeGen/AMDGPU/rsq.f32-safe.ll
+52-45llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
+6-25llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+562-1,5993 files

LLVM/project ee387f3llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp, llvm/test/CodeGen/AMDGPU rsq.f64.ll amdgpu-codegenprepare-fdiv.f64.ll

X not Y0
DeltaFile
+144-144llvm/test/CodeGen/AMDGPU/rsq.f64.ll
+9-9llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.f64.ll
+1-1llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+154-1543 files

LLVM/project 5efbb71llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp

swap
DeltaFile
+3-3llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+3-31 files

LLVM/project c9231c5llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp

nnan not useful
DeltaFile
+1-3llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+1-31 files

LLVM/project c8714d2llvm/test/CodeGen/AMDGPU rsq.f64.ll amdgpu-codegenprepare-fdiv.f64.ll

nnan not useful
DeltaFile
+8-12llvm/test/CodeGen/AMDGPU/rsq.f64.ll
+6-4llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.f64.ll
+14-162 files

LLVM/project 67bb3a6llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp

Interested mask bug
DeltaFile
+1-1llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+1-11 files

LLVM/project 1d74991llvm/lib/Target/AMDGPU AMDGPUCodeGenPrepare.cpp, llvm/test/CodeGen/AMDGPU rsq.f64.ll amdgpu-codegenprepare-fdiv.f64.ll

AMDGPU: Introduce f64 rsq pattern in AMDGPUCodeGenPrepare

Handle this here instead of DAGCombine, mostly because the f32
case is handled here due to the dependency on !fpmath. Also we can
take advantage of computeKnownFPClass.
DeltaFile
+4,536-3,195llvm/test/CodeGen/AMDGPU/rsq.f64.ll
+309-56llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.f64.ll
+129-12llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+4,974-3,2633 files

LLVM/project 33fb5b5mlir/lib/Target/LLVMIR/Dialect/OpenMP OpenMPToLLVMIRTranslation.cpp

fix comment
DeltaFile
+2-1mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+2-11 files

LLVM/project ad66b67llvm/include/llvm/IR RuntimeLibcalls.td, llvm/test/CodeGen/AArch64 chkstk.ll

[RuntimeLibcalls] Fix stack probes for aarch64 mingw (#172973)

Before bb993a89a873ee832d063fb566c889ca3d0c2b72, the stack probes on
aarch64 windows unconditionally called "__chkstk"; after this
refactoring, it only got "__chkstk" for MSVC environments, and no stack
probe at all for other environments.

Simplify RuntimeLibcalls.td to do the same for aarch64 as for arm,
simply adding "__chkstk" for all Windows variants, and add a test to
cover this case.
DeltaFile
+1-4llvm/include/llvm/IR/RuntimeLibcalls.td
+2-0llvm/test/CodeGen/AArch64/chkstk.ll
+3-42 files

LLVM/project e8391d4llvm/lib/Target/AArch64 AArch64InstrFormats.td, llvm/test/CodeGen/AArch64 rcpc3-sve.ll

[AArch64] Set default schedule of load-acquire RCpc instructions. (#172881)

This patch sets the default schedule of RCpc load-acquires to WriteLD,
same as it's done for rcpc-immo load-acquires.
DeltaFile
+1-1llvm/lib/Target/AArch64/AArch64InstrFormats.td
+1-1llvm/test/CodeGen/AArch64/rcpc3-sve.ll
+2-22 files

LLVM/project 24916b5mlir/include/mlir/Dialect/OpenMP OpenMPClauses.td, mlir/lib/Target/LLVMIR/Dialect/OpenMP OpenMPToLLVMIRTranslation.cpp

comments fixes
DeltaFile
+9-5mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+1-1mlir/include/mlir/Dialect/OpenMP/OpenMPClauses.td
+10-62 files

LLVM/project ecb744fmlir/python CMakeLists.txt

try fix windows badcast
DeltaFile
+6-1mlir/python/CMakeLists.txt
+6-11 files

LLVM/project 0f4c88dflang/lib/Lower/OpenMP OpenMP.cpp, flang/lib/Optimizer/OpenMP LowerWorkdistribute.cpp

Remove separate thread_limit argument from clause
DeltaFile
+29-45mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
+19-19mlir/test/Dialect/OpenMP/invalid.mlir
+9-7flang/lib/Lower/OpenMP/OpenMP.cpp
+8-8mlir/include/mlir/Dialect/OpenMP/OpenMPClauses.td
+6-5mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+4-4flang/lib/Optimizer/OpenMP/LowerWorkdistribute.cpp
+75-882 files not shown
+78-908 files

LLVM/project f7f5a4cmlir/include/mlir/Dialect/OpenMP OpenMPClauses.td

update thread_limit description
DeltaFile
+12-1mlir/include/mlir/Dialect/OpenMP/OpenMPClauses.td
+12-11 files

LLVM/project 293b906flang/lib/Optimizer/OpenMP LowerWorkdistribute.cpp, mlir/include/mlir/Dialect/OpenMP OpenMPClauses.td

[OpenMP][MLIR] Add thread_limit with dims modifier support
DeltaFile
+141-8mlir/test/Dialect/OpenMP/invalid.mlir
+70-2mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
+28-1mlir/include/mlir/Dialect/OpenMP/OpenMPClauses.td
+10-6flang/lib/Optimizer/OpenMP/LowerWorkdistribute.cpp
+11-5mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
+7-1mlir/test/Dialect/OpenMP/ops.mlir
+267-236 files

LLVM/project 64d4d10llvm/test/Transforms/InstCombine freeze.ll

[InstCombine] Add more freeze PHI tests (NFC) (#172842)

Adds a number of test cases where we have to be careful when pushing
freeze around. These first two tests are taken from #157678 which failed
to land to due compile-time issues. The last two tests were reduced from
llvm-opt-benchmark workloads on #171435 which was an attempt at
addressing underlying cause of the hangs.
DeltaFile
+241-3llvm/test/Transforms/InstCombine/freeze.ll
+241-31 files

LLVM/project 0db4393llvm/test/CodeGen/AMDGPU amdgpu-codegenprepare-fdiv.f64.ll

AMDGPU: Add baseline tests for f64 rsq pattern handling (#172052)

DeltaFile
+526-0llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.f64.ll
+526-01 files

LLVM/project 8c93254llvm/test/Transforms/LoopStrengthReduce/X86 icmp-zero-offset-overflow.ll

[LSR] Add test for implicit truncation on icmp immediate (NFC)

Test case for:
https://github.com/llvm/llvm-project/pull/171456#issuecomment-3670755137
DeltaFile
+48-0llvm/test/Transforms/LoopStrengthReduce/X86/icmp-zero-offset-overflow.ll
+48-01 files

LLVM/project dbd1a04mlir/python CMakeLists.txt

try fix windows badcast
DeltaFile
+5-1mlir/python/CMakeLists.txt
+5-11 files

LLVM/project 0182c85llvm/bindings/ocaml/llvm llvm.mli llvm_ocaml.c, llvm/test/Bindings/OCaml core.ml

[OCaml] Remove deprecated const_nuw_neg and build_nuw_neg APIs (#171466)

I think we usually remove deprecated C API functions from the OCaml
bindings right away. These two at least should be essentially useless.
DeltaFile
+0-12llvm/bindings/ocaml/llvm/llvm.mli
+0-12llvm/bindings/ocaml/llvm/llvm_ocaml.c
+0-4llvm/test/Bindings/OCaml/core.ml
+0-3llvm/bindings/ocaml/llvm/llvm.ml
+0-314 files

LLVM/project 6d5274cmlir/python CMakeLists.txt

try fix windows badcast
DeltaFile
+3-1mlir/python/CMakeLists.txt
+3-11 files

LLVM/project a22d1b5llvm/include/llvm/IR Constants.h, llvm/lib/Transforms/Scalar StraightLineStrengthReduce.cpp

[ConstantInt] Add ImplicitTrunc parameter to getSigned() (NFC) (#172875)

For consistency with `ConstantInt::get()`, add an ImplicitTrunc
parameter to `ConstantInt::getSigned()` as well. It currently defaults
to true and will be flipped to false in the future (by #171456).
DeltaFile
+7-4llvm/include/llvm/IR/Constants.h
+2-2llvm/lib/Transforms/Scalar/StraightLineStrengthReduce.cpp
+9-62 files

LLVM/project 8994d39clang/test/Modules pr149404.cppm

[NFC] [C++20] [Modules] Add a test for issue 149404

Close https://github.com/llvm/llvm-project/issues/149404 and add its
reproducer
DeltaFile
+70-0clang/test/Modules/pr149404.cppm
+70-01 files

LLVM/project 04e164ellvm/lib/Target/LoongArch LoongArchISelLowering.cpp, llvm/test/CodeGen/LoongArch issue172964.ll

[LoongArch] Fix assertion when combining SETCC with wide constants
DeltaFile
+21-0llvm/test/CodeGen/LoongArch/issue172964.ll
+2-2llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
+23-22 files

LLVM/project ecf979cmlir/docs/Tutorials/Toy Ch-6.md

[mlir] [docs] fix toy readme (#172262)

mlir toy docs use the deprecated api, so I fix it.
DeltaFile
+2-2mlir/docs/Tutorials/Toy/Ch-6.md
+2-21 files

LLVM/project 6a5dd8bcmake/Modules CMakePolicy.cmake, mlir/python CMakeLists.txt

try fix windows badcast
DeltaFile
+5-0cmake/Modules/CMakePolicy.cmake
+2-0mlir/python/CMakeLists.txt
+7-02 files

LLVM/project cdaa08ebolt/test dump-dot-func.test lsda-section-name.cpp, bolt/test/AArch64 compact-code-model.s

[BOLT] Add missing system-linux requirements to tests

Many lit tests are failing on macOS, as they are linux-specific,
but not using the "REQUIRES: system-linux" setting.

This patch adds the missing requirements to tests failing on macOS.
DeltaFile
+4-2bolt/test/dump-dot-func.test
+3-1bolt/test/lsda-section-name.cpp
+3-0bolt/test/timers.c
+3-0bolt/test/yaml-profile-kind.c
+2-0bolt/test/AArch64/compact-code-model.s
+2-0bolt/test/dwarf5-missing-dwo.c
+17-32 files not shown
+21-38 files