LLVM/project e700f15clang/include/clang/Basic LangOptions.def, clang/test/ClangScanDeps strip-codegen-args.m

[clang] Unify -mspeculative-load-hardening as a benign compiler option (#166640)

Before this patch, compilations using modules &
`-mspeculative-load-hardening` failed because it was not consistent
whether this option impacts module compatibility.

This repairs it by always treating it as benign. This was determined by
checking if the option enables any kind of preprocessor checks and
comparing how the compiler handles a similar option that impacts codegen
(-fvisibility=hidden) but is more widely used.

resolves: rdar://163985667
DeltaFile
+6-0clang/test/ClangScanDeps/strip-codegen-args.m
+1-1clang/include/clang/Basic/LangOptions.def
+7-12 files

LLVM/project 95557e3flang/lib/Optimizer/Builder CUDAIntrinsicCall.cpp, flang/test/Lower/CUDA cuda-device-proc.cuf

[flang][cuda][NFC] Use the NVVM op for syncwarp (#166695)

DeltaFile
+1-7flang/lib/Optimizer/Builder/CUDAIntrinsicCall.cpp
+2-2flang/test/Lower/CUDA/cuda-device-proc.cuf
+3-92 files

LLVM/project 9a0000bllvm/lib/Target/AMDGPU FLATInstructions.td, llvm/test/MC/AMDGPU gfx9_asm_flat.s gfx90a_ldst_acc.s

Revert "[AMDGPU][MC] GFX9 - Support NV bit in FLAT instructions in pre-GFX90A" (#166693)

Reverts llvm/llvm-project#154237

It breaks bot: https://lab.llvm.org/buildbot/#/builders/123/builds/30172
DeltaFile
+0-864llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+0-858llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+47-89llvm/lib/Target/AMDGPU/FLATInstructions.td
+26-26llvm/test/MC/AMDGPU/gfx90a_ldst_acc.s
+0-43llvm/test/MC/AMDGPU/gfx90a_err.s
+0-28llvm/test/MC/AMDGPU/gfx942_err.s
+73-1,9083 files not shown
+76-1,9399 files

LLVM/project 3361e40libcxx/utils/ci buildkite-pipeline.yml

[AIX] Enable libc++ bots on AIX (#166650)

Removing skip after confirming builds pass locally. Upgraded workers to
clang 20

Fixes #162516
DeltaFile
+0-2libcxx/utils/ci/buildkite-pipeline.yml
+0-21 files

LLVM/project a3f0889llvm/lib/IR Constants.cpp, llvm/test/Instrumentation/HWAddressSanitizer basic.ll

[WIP][IR][Constants] Use `ptr addrspace(N) zeroinitializer` to represent a zero-value pointer
DeltaFile
+41-41llvm/test/Instrumentation/HWAddressSanitizer/X86/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/RISCV/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/basic.ll
+16-16llvm/test/Transforms/LoopStrengthReduce/funclet.ll
+28-3llvm/lib/IR/Constants.cpp
+12-12llvm/test/Instrumentation/MemorySanitizer/msan_basic.ll
+149-12433 files not shown
+307-20339 files

LLVM/project 9042c43llvm/lib/Target/AMDGPU FLATInstructions.td, llvm/test/MC/AMDGPU gfx9_asm_flat.s gfx90a_ldst_acc.s

Revert "[AMDGPU][MC] GFX9 - Support NV bit in FLAT instructions in pre-GFX90A…"

This reverts commit db6231b4c2e18bb5fc107624e9c9071b02124844.
DeltaFile
+0-864llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+0-858llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+47-89llvm/lib/Target/AMDGPU/FLATInstructions.td
+26-26llvm/test/MC/AMDGPU/gfx90a_ldst_acc.s
+0-43llvm/test/MC/AMDGPU/gfx90a_err.s
+0-28llvm/test/MC/AMDGPU/gfx942_err.s
+73-1,9083 files not shown
+76-1,9399 files

LLVM/project 27a981cllvm/lib/IR Constants.cpp, llvm/test/Instrumentation/HWAddressSanitizer basic.ll

[WIP][IR][Constants] Use `ptr addrspace(N) zeroinitializer` to represent a zero-value pointer
DeltaFile
+41-41llvm/test/Instrumentation/HWAddressSanitizer/X86/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/RISCV/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/basic.ll
+16-16llvm/test/Transforms/LoopStrengthReduce/funclet.ll
+28-3llvm/lib/IR/Constants.cpp
+12-12llvm/test/Instrumentation/MemorySanitizer/msan_basic.ll
+149-12434 files not shown
+302-20440 files

LLVM/project 51d3fcamlir/include/mlir/Conversion/LLVMCommon VectorPattern.h, mlir/lib/Conversion/ArithToLLVM ArithToLLVM.cpp

[mlir][arith] Fix `arith.select` lowering after #166513
DeltaFile
+8-5mlir/include/mlir/Conversion/LLVMCommon/VectorPattern.h
+4-2mlir/test/Conversion/ArithToLLVM/arith-to-llvm.mlir
+3-1mlir/lib/Conversion/ArithToLLVM/ArithToLLVM.cpp
+15-83 files

LLVM/project 3680b11mlir/include/mlir/Dialect/LLVMIR FunctionCallUtils.h, mlir/lib/Dialect/LLVMIR/IR FunctionCallUtils.cpp

add X-macros
DeltaFile
+65-10mlir/lib/ExecutionEngine/APFloatWrappers.cpp
+22-11mlir/lib/Dialect/LLVMIR/IR/FunctionCallUtils.cpp
+17-3mlir/include/mlir/Dialect/LLVMIR/FunctionCallUtils.h
+104-243 files

LLVM/project 48876ecmlir/include/mlir/Conversion/LLVMCommon VectorPattern.h, mlir/lib/Conversion/ArithToLLVM ArithToLLVM.cpp

[mlir][arith] Fix `arith.select` lowering after #166513
DeltaFile
+8-5mlir/include/mlir/Conversion/LLVMCommon/VectorPattern.h
+4-2mlir/test/Conversion/ArithToLLVM/arith-to-llvm.mlir
+4-1mlir/lib/Conversion/ArithToLLVM/ArithToLLVM.cpp
+16-83 files

LLVM/project bede88dclang/www cxx_dr_status.html, llvm/test/CodeGen/AMDGPU/GlobalISel srem.i64.ll

Merge branch 'main' into users/zhaoqi5/enable-sink-fold
DeltaFile
+5,100-0llvm/test/CodeGen/RISCV/GlobalISel/rvv/vloxei.ll
+5,100-0llvm/test/CodeGen/RISCV/GlobalISel/rvv/vluxei.ll
+4,881-0llvm/test/CodeGen/RISCV/GlobalISel/rvv/vsoxei.ll
+4,881-0llvm/test/CodeGen/RISCV/GlobalISel/rvv/vsuxei.ll
+1,561-1,561llvm/test/CodeGen/AMDGPU/GlobalISel/srem.i64.ll
+3,067-10clang/www/cxx_dr_status.html
+24,590-1,5711,852 files not shown
+86,990-21,9641,858 files

LLVM/project 66813ccllvm/test/Instrumentation/HWAddressSanitizer basic.ll, llvm/test/Instrumentation/HWAddressSanitizer/RISCV basic.ll

[WIP][IR][Constants] Use `ptr addrspace(N) zeroinitializer` to represent a zero-value pointer
DeltaFile
+41-41llvm/test/Instrumentation/HWAddressSanitizer/X86/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/basic.ll
+26-26llvm/test/Instrumentation/HWAddressSanitizer/RISCV/basic.ll
+32-16llvm/test/Transforms/InstSimplify/and-or-icmp-zero.ll
+20-20llvm/test/Transforms/LoopStrengthReduce/funclet.ll
+8-23llvm/test/Transforms/InstCombine/ptrtoint-nullgep.ll
+153-15244 files not shown
+361-28050 files

LLVM/project 260f9e9llvm/utils profcheck-xfail.txt

[profcheck] Update profcheck-xfail.txt after recent fixes (#166685)

DeltaFile
+1-102llvm/utils/profcheck-xfail.txt
+1-1021 files

LLVM/project 67b6fd0llvm/lib/Target/AMDGPU SIFoldOperands.cpp, llvm/test/CodeGen/AMDGPU llvm.is.fpclass.f16.ll true16-fold.mir

AMDGPU: Delete redundant recursive copy handling code (#157032)

This fixes a regression exposed after
445415219708f9539801018e03282049ca33e0e2.
This introduces a few small regressions for true16. There are more cases
where the value can propagate through subregister extracts which need
new handling. They're also small enough that perhaps there's a way to
avoid needing to deal with this case in the first place.
DeltaFile
+346-210llvm/test/CodeGen/AMDGPU/llvm.is.fpclass.f16.ll
+34-1llvm/test/CodeGen/AMDGPU/true16-fold.mir
+0-29llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
+29-0llvm/test/CodeGen/AMDGPU/true16-imm-folded-to-0-regression.ll
+409-2404 files

LLVM/project 08d3f97llvm/test/MC/AMDGPU gfx9_asm_flat.s, llvm/test/MC/Disassembler/AMDGPU gfx9_flat.txt

rebase

Created using spr 1.3.7
DeltaFile
+864-0llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+858-0llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+694-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-and-casts.ll
+659-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-chained.ll
+1-537llvm/utils/profcheck-xfail.txt
+25-250llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp
+3,101-787252 files not shown
+6,701-1,704258 files

LLVM/project 1b232e5llvm/test/MC/AMDGPU gfx9_asm_flat.s, llvm/test/MC/Disassembler/AMDGPU gfx9_flat.txt

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+864-0llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+858-0llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+694-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-and-casts.ll
+659-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-chained.ll
+1-537llvm/utils/profcheck-xfail.txt
+25-250llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp
+3,101-787252 files not shown
+6,701-1,704258 files

LLVM/project 4ae6066llvm/test/MC/AMDGPU gfx9_asm_flat.s, llvm/test/MC/Disassembler/AMDGPU gfx9_flat.txt

rebase

Created using spr 1.3.7
DeltaFile
+864-0llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+858-0llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+694-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-and-casts.ll
+659-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-chained.ll
+1-537llvm/utils/profcheck-xfail.txt
+25-250llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp
+3,101-787252 files not shown
+6,701-1,704258 files

LLVM/project c4d1cc4llvm/test/MC/AMDGPU gfx9_asm_flat.s, llvm/test/MC/Disassembler/AMDGPU gfx9_flat.txt

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+864-0llvm/test/MC/Disassembler/AMDGPU/gfx9_flat.txt
+858-0llvm/test/MC/AMDGPU/gfx9_asm_flat.s
+694-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-and-casts.ll
+659-0llvm/test/Transforms/LoopVectorize/AArch64/transform-narrow-interleave-to-widen-memory-with-wide-ops-chained.ll
+1-537llvm/utils/profcheck-xfail.txt
+25-250llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp
+3,101-787252 files not shown
+6,701-1,704258 files

LLVM/project 5321f5ccompiler-rt/test/profile instrprof-tmpdir.c

[compiler-rt][Profile] Temporarily Disable Test on AIX

This was very hackily patched in
16ef893e9fdec2b08dafc82f5450b41834e09039 to not use env -u. The internal
shell does not support unset, but does supprt env -u. Disable the test
for now so we can enable the internal shell with a TODO to enable it
after the internal shell landing has stuck.

Reviewers: fmayer, vitalybuka, w2yehia, daltenty, mingmingl-llvm, madanial0

Reviewed By: mingmingl-llvm

Pull Request: https://github.com/llvm/llvm-project/pull/166637
DeltaFile
+6-2compiler-rt/test/profile/instrprof-tmpdir.c
+6-21 files

LLVM/project c3eacf9llvm/utils profcheck-xfail.txt

[profcheck] Update profcheck-xfail.txt after recent fixes
DeltaFile
+1-102llvm/utils/profcheck-xfail.txt
+1-1021 files

LLVM/project 9c42dcdmlir/include/mlir/Conversion/LLVMCommon VectorPattern.h, mlir/lib/Conversion/ArithToLLVM ArithToLLVM.cpp

[mlir][arith] Fix `arith.cmpf` lowering with unsupported FP types
DeltaFile
+14-27mlir/include/mlir/Conversion/LLVMCommon/VectorPattern.h
+21-0mlir/lib/Conversion/LLVMCommon/VectorPattern.cpp
+8-4mlir/test/Conversion/ArithToLLVM/arith-to-llvm.mlir
+4-0mlir/lib/Conversion/ArithToLLVM/ArithToLLVM.cpp
+47-314 files

LLVM/project 77e3975llvm/lib/Transforms/Scalar LoopSimplifyCFG.cpp, llvm/test/Transforms/LoopSimplifyCFG constant-fold-branch.ll

[LSCFG][profcheck] Add dummy branch weights for the dummy switch to dead exits (#164714)

As noted in the doc comment of `handleDeadExits`​, the dummy switch is just an artifact of the constraints placed by the fact that we operate in a loop pass. Adding weights here is unnecessary, but the complexity is low, and it helps keep things easy for profile propagation verification (in a sense, the overall complexity would be higher if we special-cased this somehow).

Issue #147390
DeltaFile
+54-50llvm/test/Transforms/LoopSimplifyCFG/constant-fold-branch.ll
+12-0llvm/lib/Transforms/Scalar/LoopSimplifyCFG.cpp
+66-502 files

LLVM/project b84784fllvm/test lit.cfg.py, llvm/utils profcheck-xfail.txt

[profcheck] Exclude GPU tests (#166681)

DeltaFile
+0-92llvm/utils/profcheck-xfail.txt
+2-0llvm/test/lit.cfg.py
+2-922 files

LLVM/project d6713e7bolt/include/bolt/Profile YAMLProfileReader.h YAMLProfileWriter.h, bolt/lib/Profile YAMLProfileReader.cpp StaleProfileMatching.cpp

[𝘀𝗽𝗿] changes to main this commit is based on

Created using spr 1.3.4

[skip ci]
DeltaFile
+385-86bolt/lib/Profile/YAMLProfileReader.cpp
+194-232bolt/lib/Profile/StaleProfileMatching.cpp
+68-55bolt/lib/Profile/YAMLProfileWriter.cpp
+69-46bolt/include/bolt/Profile/YAMLProfileReader.h
+19-21bolt/include/bolt/Profile/YAMLProfileWriter.h
+12-14bolt/include/bolt/Profile/ProfileYAMLMapping.h
+747-4548 files not shown
+782-50314 files

LLVM/project 5077d56bolt/include/bolt/Profile YAMLProfileReader.h YAMLProfileWriter.h, bolt/lib/Profile YAMLProfileReader.cpp StaleProfileMatching.cpp

[𝘀𝗽𝗿] initial version

Created using spr 1.3.4
DeltaFile
+391-86bolt/lib/Profile/YAMLProfileReader.cpp
+194-232bolt/lib/Profile/StaleProfileMatching.cpp
+144-64bolt/lib/Profile/YAMLProfileWriter.cpp
+69-46bolt/include/bolt/Profile/YAMLProfileReader.h
+20-23bolt/include/bolt/Profile/YAMLProfileWriter.h
+16-14bolt/include/bolt/Profile/ProfileYAMLMapping.h
+834-46510 files not shown
+873-51816 files

LLVM/project c6036c7llvm/test lit.cfg.py, llvm/utils profcheck-xfail.txt

[profcheck] Exclude GPU tests
DeltaFile
+0-92llvm/utils/profcheck-xfail.txt
+2-0llvm/test/lit.cfg.py
+2-922 files

LLVM/project 93ef576llvm/utils profcheck-xfail.txt

Add binop-select test to profcheck-xfail.txt (#166678)

Revealed in PR #166102, which itself doesn't _cause_ the profile being
dropped. Referencing if it makes debugging easier.
DeltaFile
+1-0llvm/utils/profcheck-xfail.txt
+1-01 files

LLVM/project 47e4501flang/lib/Lower OpenACC.cpp, flang/test/Lower/OpenACC acc-kernels-loop.f90 acc-kernels.f90

[flang][acc] Lower zero modifier for Copyout clause (#166660)

From the OpenACC 3.4 Specification:
```
1924 2.7.9 copyout clause
1925 The copyout clause may appear on structured data and compute constructs, on declare di
1926 rectives, and on exit data directives. The clause may optionally have a zero modifier if the
1927 copyout clause appears on a structured data or compute construct.
1928 Only the following modifiers may appear in the optional modifier-list: always, alwaysin or zero.
1929 Additionally, on structured data and compute constructs capture modifier may appear
```
`readonly` is not a legal modifier for the `copyout` clause. The call to
`genDataOperandOperationsWithModifier` should be checking the parsed
modifier for the `copyout` clause against the `Zero` modifier.
DeltaFile
+2-2flang/lib/Lower/OpenACC.cpp
+2-2flang/test/Lower/OpenACC/acc-kernels-loop.f90
+2-2flang/test/Lower/OpenACC/acc-kernels.f90
+2-2flang/test/Lower/OpenACC/acc-parallel-loop.f90
+2-2flang/test/Lower/OpenACC/acc-parallel.f90
+2-2flang/test/Lower/OpenACC/acc-serial-loop.f90
+12-121 files not shown
+14-147 files

LLVM/project 1262dcellvm/lib/Target/WebAssembly WebAssemblyInstrRef.td, llvm/test/MC/WebAssembly reference-types.s

Add support for ref.func to AsmParser/MC (#163326)

This is step 1 of exposing WASM `ref.func` to LLVM.

This PR only handles creating the instruction and a test for assembling
it.
DeltaFile
+9-0llvm/test/MC/WebAssembly/reference-types.s
+5-0llvm/lib/Target/WebAssembly/WebAssemblyInstrRef.td
+14-02 files

LLVM/project bd9030elldb/tools/debugserver/source/MacOSX MachTask.mm MachTask.h

[debugserver] Move constants into TaskPortForProcessID (NFC) (#166670)

I was looking at the calls to `usleep` in debugserver and noticed that
these default arguments are never overwritten. I converted them to
constants in the function, which makes it easier to reason about.
DeltaFile
+6-5lldb/tools/debugserver/source/MacOSX/MachTask.mm
+1-3lldb/tools/debugserver/source/MacOSX/MachTask.h
+7-82 files