LLVM/project c1cce45clang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

UNSUPPORTED: system-windows for now
DeltaFile
+1-0clang/test/Driver/hip-temps-linux.hip
+1-0clang/test/Driver/hip-temps-windows.hip
+2-02 files

LLVM/project f893f72clang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

Revert "try unquoting and repeat env"

This reverts commit 9aa8b8e20f9b0912b10194d5aae89c42e4437fae.
DeltaFile
+1-1clang/test/Driver/hip-temps-linux.hip
+1-1clang/test/Driver/hip-temps-windows.hip
+2-22 files

LLVM/project 7641ed2clang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

Revert "try not using \ to break long run lines"

This reverts commit ac138ba8e5a9861e7dda74ee102575f1c5cfeb3b.
DeltaFile
+4-1clang/test/Driver/hip-temps-linux.hip
+4-1clang/test/Driver/hip-temps-windows.hip
+8-22 files

LLVM/project 7d9b7e8llvm/test/CodeGen/PowerPC llvm.sincospi.ppcfp128.ll

PPC: Mark xfailed sincospi test as unsupported with EXPENSIVE_CHECKS (#167639)

DeltaFile
+1-0llvm/test/CodeGen/PowerPC/llvm.sincospi.ppcfp128.ll
+1-01 files

LLVM/project a664cadllvm/lib/CodeGen WasmEHPrepare.cpp

Wasm: Fix hardcoding _Unwind_CallPersonality function name (#167612)

DeltaFile
+7-1llvm/lib/CodeGen/WasmEHPrepare.cpp
+7-11 files

LLVM/project f4ee03fllvm/lib/CodeGen/AsmPrinter CodeViewDebug.cpp CodeViewDebug.h

Store TheCU
DeltaFile
+6-11llvm/lib/CodeGen/AsmPrinter/CodeViewDebug.cpp
+2-0llvm/lib/CodeGen/AsmPrinter/CodeViewDebug.h
+8-112 files

LLVM/project ce32b73orc-rt/include/orc-rt Session.h

Orc rt session wrap unwrap (#167635)

DeltaFile
+10-0orc-rt/include/orc-rt/Session.h
+10-01 files

LLVM/project ce7449ellvm/lib/Target/AMDGPU AMDGPUTargetMachine.cpp AMDGPULowerExecSync.cpp, llvm/test/CodeGen/AMDGPU llc-pipeline.ll

Update description of pass
DeltaFile
+5-5llvm/test/CodeGen/AMDGPU/llc-pipeline.ll
+4-4llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
+2-2llvm/lib/Target/AMDGPU/AMDGPULowerExecSync.cpp
+11-113 files

LLVM/project ac138baclang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

try not using \ to break long run lines
DeltaFile
+1-4clang/test/Driver/hip-temps-linux.hip
+1-4clang/test/Driver/hip-temps-windows.hip
+2-82 files

LLVM/project f7bca05llvm/lib/Target/AMDGPU AMDGPULowerModuleLDSPass.cpp

[AMDGPU] Remove lowering named-barrier LDS logci from amdgpu-lower-module-lds
DeltaFile
+0-126llvm/lib/Target/AMDGPU/AMDGPULowerModuleLDSPass.cpp
+0-1261 files

LLVM/project e140a57llvm/test/CodeGen/PowerPC llvm.sincospi.ppcfp128.ll

PPC: Mark xfailed sincospi test as unsupported with EXPENSIVE_CHECKS
DeltaFile
+1-0llvm/test/CodeGen/PowerPC/llvm.sincospi.ppcfp128.ll
+1-01 files

LLVM/project 4f23b5cllvm/lib/Target/AMDGPU AMDGPULowerModuleLDSPass.cpp AMDGPUSwLowerLDS.cpp

remove changes from prior LDS lowerin passes
DeltaFile
+126-0llvm/lib/Target/AMDGPU/AMDGPULowerModuleLDSPass.cpp
+2-2llvm/lib/Target/AMDGPU/AMDGPUSwLowerLDS.cpp
+128-22 files

LLVM/project 57d8beallvm/lib/Target/AMDGPU AMDGPUTargetMachine.cpp, llvm/test/CodeGen/AMDGPU amdgpu-lower-exec-sync-and-module-lds.ll amdgpu-lower-special-lds-and-module-lds.ll

update names
DeltaFile
+122-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-exec-sync-and-module-lds.ll
+0-122llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-module-lds.ll
+73-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-exec-sync-and-sw-lds.ll
+0-73llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-sw-lds.ll
+9-9llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
+5-5llvm/test/CodeGen/AMDGPU/llc-pipeline.ll
+209-2092 files not shown
+213-2138 files

LLVM/project ad3a6a5llvm/test/CodeGen/AMDGPU amdgpu-lower-exec-sync.ll

Update amdgpu-lower-exec-sync.ll test with llc RUN line
DeltaFile
+1-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-exec-sync.ll
+1-01 files

LLVM/project 6ccff9ellvm/lib/Target/AMDGPU AMDGPUSwLowerLDS.cpp, llvm/test/CodeGen/AMDGPU amdgpu-lower-special-lds-and-sw-lds.ll amdgpu-lower-special-lds-and-module-lds.ll

Fix tests
DeltaFile
+22-35llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-sw-lds.ll
+2-3llvm/lib/Target/AMDGPU/AMDGPUSwLowerLDS.cpp
+3-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-module-lds.ll
+27-383 files

LLVM/project b8ab481llvm/lib/Target/AMDGPU AMDGPULowerModuleLDSPass.cpp AMDGPUTargetMachine.cpp, llvm/test/CodeGen/AMDGPU amdgpu-lower-special-lds-and-module-lds.ll amdgpu-lower-special-lds-and-sw-lds.ll

[AMDGPU] Enable amdgpu-lower-special-lds pass in pipeline
DeltaFile
+0-126llvm/lib/Target/AMDGPU/AMDGPULowerModuleLDSPass.cpp
+119-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-module-lds.ll
+86-0llvm/test/CodeGen/AMDGPU/amdgpu-lower-special-lds-and-sw-lds.ll
+14-0llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
+6-0llvm/lib/Target/AMDGPU/AMDGPUMemoryUtils.cpp
+3-3llvm/test/CodeGen/AMDGPU/llc-pipeline-npm.ll
+228-1293 files not shown
+236-1319 files

LLVM/project 0eadd18clang/include/clang/Basic BuiltinsAMDGPU.def, clang/lib/CodeGen/TargetBuiltins AMDGPU.cpp

Review comments: remove the float overload.
DeltaFile
+48-48clang/test/CodeGenOpenCL/builtins-amdgcn.cl
+12-8clang/lib/CodeGen/TargetBuiltins/AMDGPU.cpp
+4-4clang/include/clang/Basic/BuiltinsAMDGPU.def
+64-603 files

LLVM/project 42368eellvm/include/llvm/IR IntrinsicsAMDGPU.td, llvm/lib/Target/AMDGPU SIInstructions.td SIISelLowering.cpp

Review comments: remove the `.float` suffix and overload.
DeltaFile
+130-96llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.sub.ll
+129-77llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.add.ll
+2-2llvm/lib/Target/AMDGPU/SIInstructions.td
+2-2llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-0llvm/lib/Target/AMDGPU/AMDGPURegisterBankInfo.cpp
+1-1llvm/include/llvm/IR/IntrinsicsAMDGPU.td
+266-1786 files

LLVM/project 8b530f5clang/include/clang/Basic BuiltinsAMDGPU.def, clang/lib/CodeGen/TargetBuiltins AMDGPU.cpp

[AMDGPU] Add builtins for wave reduction intrinsics
DeltaFile
+84-0clang/test/CodeGenOpenCL/builtins-amdgcn.cl
+8-0clang/lib/CodeGen/TargetBuiltins/AMDGPU.cpp
+4-0clang/include/clang/Basic/BuiltinsAMDGPU.def
+96-03 files

LLVM/project a397e67llvm/lib/Target/AMDGPU SIISelLowering.cpp SIInstructions.td, llvm/test/CodeGen/AMDGPU llvm.amdgcn.reduce.sub.ll llvm.amdgcn.reduce.add.ll

[AMDGPU] Add wave reduce intrinsics for float types - 2

Supported Ops: `fadd`, `fsub`
DeltaFile
+967-0llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.sub.ll
+949-0llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.add.ll
+39-3llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-0llvm/lib/Target/AMDGPU/SIInstructions.td
+1,957-34 files

LLVM/project 6823009llvm/lib/Target/AMDGPU SIISelLowering.cpp

Hardcode quietNaN val.
DeltaFile
+2-4llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-41 files

LLVM/project e10eb45llvm/lib/CodeGen/AsmPrinter CodeViewDebug.cpp, llvm/test/DebugInfo/X86 codeview-empty-dbg-cu-crash.ll

Redo this. The compile unit iterator has the unhelpful behavior
of skipping entries that are there.
DeltaFile
+20-20llvm/lib/CodeGen/AsmPrinter/CodeViewDebug.cpp
+4-2llvm/test/DebugInfo/X86/codeview-empty-dbg-cu-crash.ll
+24-222 files

LLVM/project aadcb99llvm/lib/CodeGen/AsmPrinter CodeViewDebug.cpp, llvm/test/DebugInfo/X86 codeview-empty-dbg-cu-crash.ll

Emit 0 instead of incomplete information
DeltaFile
+10-10llvm/lib/CodeGen/AsmPrinter/CodeViewDebug.cpp
+7-0llvm/test/DebugInfo/X86/codeview-empty-dbg-cu-crash.ll
+17-102 files

LLVM/project 633af6ellvm/lib/CodeGen/AsmPrinter CodeViewDebug.cpp, llvm/test/DebugInfo/X86 codeview-empty-dbg-cu-crash.ll

CodeGen: Fix CodeView crashes with empty llvm.dbg.cu
DeltaFile
+30-0llvm/test/DebugInfo/X86/codeview-empty-dbg-cu-crash.ll
+15-6llvm/lib/CodeGen/AsmPrinter/CodeViewDebug.cpp
+45-62 files

LLVM/project 9aa8b8eclang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

try unquoting and repeat env
DeltaFile
+1-1clang/test/Driver/hip-temps-linux.hip
+1-1clang/test/Driver/hip-temps-windows.hip
+2-22 files

LLVM/project 73fa090clang/test/Driver hip-temps-linux.hip hip-temps-windows.hip

clang: Remove requires system from hip driver tests
DeltaFile
+3-4clang/test/Driver/hip-temps-linux.hip
+3-4clang/test/Driver/hip-temps-windows.hip
+6-82 files

LLVM/project 5465979libunwind/include libunwind.h, libunwind/src libunwind.cpp libunwind_ext.h

libunwind: Implement the unw_strerror function for better nongnu libunwind compatibility (#160887)

As it was explained to me in
https://discourse.llvm.org/t/libunwinds-raison-detre/88283/2 the LLVM
version of libunwind is mostly compatible with nongnu one. This change
improves the compatibility a bit further.
DeltaFile
+35-0libunwind/src/libunwind.cpp
+1-0libunwind/include/libunwind.h
+1-0libunwind/src/libunwind_ext.h
+37-03 files

LLVM/project adac35fllvm/include/llvm/IR IntrinsicsAMDGPU.td, llvm/lib/Target/AMDGPU SIISelLowering.cpp SIInstructions.td

Review comments: remove `.float` suffix
DeltaFile
+90-60llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.max.ll
+90-60llvm/test/CodeGen/AMDGPU/llvm.amdgcn.reduce.min.ll
+2-3llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+2-2llvm/lib/Target/AMDGPU/SIInstructions.td
+1-1llvm/include/llvm/IR/IntrinsicsAMDGPU.td
+2-0llvm/lib/Target/AMDGPU/AMDGPURegisterBankInfo.cpp
+187-1266 files

LLVM/project c5aace4llvm/include/llvm/CodeGen SelectionDAG.h TargetLowering.h, llvm/lib/CodeGen/SelectionDAG SelectionDAG.cpp TargetLowering.cpp

DAG: Move expandMultipleResultFPLibCall to TargetLowering (NFC) (#166988)

This kind of helper is higher level and not general enough to go
directly in SelectionDAG. Most similar utilities are in TargetLowering.
DeltaFile
+0-161llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
+161-0llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
+0-10llvm/include/llvm/CodeGen/SelectionDAG.h
+10-0llvm/include/llvm/CodeGen/TargetLowering.h
+2-2llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp
+2-2llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
+175-1751 files not shown
+176-1767 files

LLVM/project 1f38d49llvm/include/llvm/Transforms/Utils ValueMapper.h, llvm/lib/Linker IRMover.cpp

ValueMapper: Delete unused initializers of replaced appending globals.

A full LTO link time performance and memory regression was introduced
by #137081 in cases where the modules contain large quantities of llvm.used
globals. This was unnoticed because it was not expected that this would
be a typical case, but this is exactly what coverage collection does,
and when this feature is enabled together with full LTO we end up with
quadratic memory consumption (from the unused constants) and quadratic
complexity in the function Verifier::visitGlobalValue (which visits all
the unused constants in the use list of each global value). This is a
targeted fix that avoids reintroducing the quadratic complexity from
before #137081, by having ValueMapper delete the old initializer of an
appending global if it is unused, instead of visiting every global in
the context after every link.

The repro-cfi-64 reproducer from #167037 before and after this change:

```
        Elapsed time   Max RSS (KB)

    [11 lines not shown]
DeltaFile
+15-9llvm/lib/Transforms/Utils/ValueMapper.cpp
+1-4llvm/lib/Linker/IRMover.cpp
+1-1llvm/include/llvm/Transforms/Utils/ValueMapper.h
+17-143 files