LLVM/project 1093a18llvm/lib/Transforms/Vectorize/SandboxVectorizer DependencyGraph.cpp, llvm/unittests/Transforms/Vectorize/SandboxVectorizer DependencyGraphTest.cpp

[SandboxVec][DAG] Handle unscheduled successors when user is external (#183861)

Whenever an IR use-def edge gets updated, the DAG gets notified about
the change by having its `notifySetUse()` callback called. The
callback's job is to update the DAG node's `UnscheduledSuccs` counter
which is the number of successor nodes that are yet to be scheduled.

This update makes sense only if both ends of the use-def edge are in the
DAG. Up until now we would still update the counter even if the user was
outside the DAG. This patch fixes this, so from now on we skip updatinge
`UnscheduledSuccs` if the user is outside the DAG.
DeltaFile
+41-0llvm/unittests/Transforms/Vectorize/SandboxVectorizer/DependencyGraphTest.cpp
+12-2llvm/lib/Transforms/Vectorize/SandboxVectorizer/DependencyGraph.cpp
+53-22 files

LLVM/project eafd076llvm/lib/Target/RISCV RISCVISelDAGToDAG.cpp, llvm/test/CodeGen/RISCV rvp-ext-rv32.ll rvp-ext-rv64.ll

[RISCV][P-ext] Select (splat_vector 0) as copy from X0. (#184911)
DeltaFile
+16-0llvm/lib/Target/RISCV/RISCVISelDAGToDAG.cpp
+1-2llvm/test/CodeGen/RISCV/rvp-ext-rv32.ll
+1-2llvm/test/CodeGen/RISCV/rvp-ext-rv64.ll
+18-43 files

LLVM/project 3015979clang/test/CodeGen/RISCV riscv-inline-asm.c, llvm/lib/Target/RISCV RISCVISelLowering.cpp

[RISCV] Support 'f' Inline Assembly Constraint for bfloat16 (#184566)

This patch is to add 'f' and 'cf' Inline Assembly Constraint for the `bfloat16` type, so they are passed in the floating point registers.
DeltaFile
+45-0llvm/test/CodeGen/RISCV/inline-asm-bf-constraint-f.ll
+4-0llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+4-0clang/test/CodeGen/RISCV/riscv-inline-asm.c
+53-03 files

LLVM/project 4ea72c1lld/ELF Relocations.cpp InputSection.cpp, lld/ELF/Arch RISCV.cpp

[ELF] Add target-specific relocation scanning for RISC-V (#181332)

Implement RISCV::scanSectionImpl, following the pattern established
for x86 (#178846) and AArch64 (#181099). This merges the getRelExpr
and TLS handling for SHF_ALLOC sections into the target-specific
scanner, enabling devirtualization and eliminating abstraction
overhead.

- Inline relocation classification into scanSectionImpl with a switch
  on relocation type, replacing the generic rs.scan() path.
- Use processR_PC/processR_PLT_PC for common PC-relative and PLT
  relocations.
- Handle TLS IE and GD directly (RISC-V does not optimize GD/LD/IE).
- Replace TLS-optimization-specific expressions for TLSDESC, following
  the x86 pattern: R_RELAX_TLS_GD_TO_IE -> R_GOT_PC,
  R_RELAX_TLS_GD_TO_LE -> R_TPREL. Update relocateAlloc and relax()
  to dispatch on relocation type instead of RelExpr for TLSDESC.
- Simplify getRelExpr to only handle relocations needed by
  relocateNonAlloc and preprocessRelocs.

    [4 lines not shown]
DeltaFile
+185-94lld/ELF/Arch/RISCV.cpp
+36-6lld/test/ELF/riscv-vendor-relocations.s
+32-0lld/test/ELF/riscv-vendor-relocations2.test
+6-21lld/ELF/Relocations.cpp
+2-2lld/test/ELF/riscv-reloc-leb128.s
+1-1lld/ELF/InputSection.cpp
+262-1246 files

LLVM/project f7ca74fllvm/lib/Target/RISCV RISCVInstrInfoV.td, llvm/test/MC/RISCV/rvv zvlsseg-invalid.s

[RISCV] Add register overlap checks to the assembler for vector indexed segment load (#184569)

The destination vector register group cannot overlap the source vector
register group for vector indexed segment load. This patch is to add
register overlap checks to the assembler.
DeltaFile
+66-0llvm/test/MC/RISCV/rvv/zvlsseg-invalid.s
+4-0llvm/lib/Target/RISCV/RISCVInstrInfoV.td
+70-02 files

LLVM/project 76ffbc7llvm/lib/Target/AMDGPU SIRegisterInfo.cpp, llvm/test/CodeGen/AMDGPU vgpr-spill.mir

Review comments
DeltaFile
+8-7llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
+4-4llvm/test/CodeGen/AMDGPU/vgpr-spill.mir
+12-112 files

LLVM/project f712c97clang/include/clang/DependencyScanning ModuleDepCollector.h, clang/lib/DependencyScanning ModuleDepCollector.cpp

[clang][deps] Store `IgnoreCWD` on `ModuleDeps` (#184921)

This aligns us with downstream, where we need to be able to query
whether a module depends on CWD or not.
DeltaFile
+6-5clang/lib/DependencyScanning/ModuleDepCollector.cpp
+4-1clang/include/clang/DependencyScanning/ModuleDepCollector.h
+10-62 files

LLVM/project eaae8e2llvm/lib/Target/RISCV RISCVISelLowering.cpp

[RISCV] Remove outdated TODO in isExtractSubvectorCheap (#184938)

Index 0 is already handled by an early return, so the TODO comment about
extracting index 0 from a mask vector is no longer needed.
DeltaFile
+0-1llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+0-11 files

LLVM/project 0538d0aclang/lib/Basic/Targets NVPTX.h, clang/test/Misc nvptx.unsupported_core.cl nvptx.languageOptsOpenCL.cl

[NVPTX] Enable OpenCL 3d_image_writes support (#143331)

NV supports opencl_3d_image_writes according
https://developer.nvidia.com/blog/nvidia-is-now-opencl-3-0-conformant/

This PR allows removing explicit enabling of image extensions via
-cl-ext command line option, e.g. at
https://github.com/intel/llvm/blob/43b3d42e2b2060e9e9e3a96469a1982dc4c10ddd/libclc/CMakeLists.txt#L503
DeltaFile
+0-7clang/test/Misc/nvptx.unsupported_core.cl
+2-3clang/test/Misc/nvptx.languageOptsOpenCL.cl
+4-0clang/lib/Basic/Targets/NVPTX.h
+6-103 files

LLVM/project bcb85e3llvm/test/CodeGen/AArch64 clmul-fixed.ll, llvm/test/CodeGen/RISCV/rvv clmulh-sdnode.ll clmul-sdnode.ll

Merge branch 'main' into users/c8ef/fold_left_first
DeltaFile
+53,024-7,001llvm/test/CodeGen/RISCV/rvv/clmulh-sdnode.ll
+15,172-1,553llvm/test/CodeGen/RISCV/rvv/clmul-sdnode.ll
+6,520-0llvm/test/CodeGen/X86/bit-manip-i512.ll
+3,717-56llvm/test/CodeGen/AArch64/clmul-fixed.ll
+3,441-0llvm/test/MC/AMDGPU/gfx13_asm_vflat.s
+3,257-0llvm/test/CodeGen/X86/bit-manip-i256.ll
+85,131-8,6101,222 files not shown
+123,904-21,9951,228 files

LLVM/project 8acbd0clibcxx/test/libcxx/strings/basic.string/string.cons constexpr_initialization_stress.pass.cpp

5000

Created using spr 1.3.7
DeltaFile
+1-1libcxx/test/libcxx/strings/basic.string/string.cons/constexpr_initialization_stress.pass.cpp
+1-11 files

LLVM/project 61ead49clang/lib/CIR/CodeGen CIRGenBuiltinAArch64.cpp, clang/test/AST/HLSL Texture2D-vector-AST.hlsl Texture2D-scalar-AST.hlsl

Merge branch 'main' into users/vitalybuka/spr/libcxxstring-replace-asan-volatile-wrapper-with-memory-barrier
DeltaFile
+754-88clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp
+726-0clang/test/AST/HLSL/Texture2D-vector-AST.hlsl
+722-0clang/test/AST/HLSL/Texture2D-scalar-AST.hlsl
+560-0clang/test/SemaSYCL/sycl-kernel-launch.cpp
+364-111llvm/test/CodeGen/AMDGPU/llvm.fptrunc.round.ll
+0-439clang/test/AST/HLSL/Texture2D-AST.hlsl
+3,126-638256 files not shown
+9,480-2,995262 files

LLVM/project dd79c92lld/ELF RelocScan.h, lld/ELF/Arch ARM.cpp

[ELF] handleTlsGd: support disabling GD-to-IE/LE optimization. NFC (#184934)

Use this in ARM::scanSectionImpl for R_ARM_TLS_GD32 and the upcoming
RISC-V change.
DeltaFile
+15-14lld/ELF/RelocScan.h
+1-2lld/ELF/Arch/ARM.cpp
+16-162 files

LLVM/project 1f84b73llvm/lib/Target/RISCV RISCVInstrInfoXSpacemiT.td RISCVInstrFormatsSpacemitV.td, llvm/lib/Target/RISCV/AsmParser RISCVAsmParser.cpp

[RISCV] Add contraints for SpacemiT X60 AI VDot Insts (#174364)

Spec: https://developer.spacemit.com/documentation?token=BWbGwbx7liGW21kq9lucSA6Vnpb#2.1
DeltaFile
+77-79llvm/test/MC/RISCV/xsmtvdot-valid.s
+32-124llvm/lib/Target/RISCV/RISCVInstrInfoXSpacemiT.td
+103-0llvm/lib/Target/RISCV/RISCVInstrFormatsSpacemitV.td
+30-45llvm/test/MC/RISCV/xsmtvdot-invalid.s
+28-8llvm/lib/Target/RISCV/AsmParser/RISCVAsmParser.cpp
+270-2565 files

LLVM/project f409804utils/bazel/llvm-project-overlay/clang-tools-extra/clang-tidy BUILD.bazel

[Bazel] Fix build for #184030

hicpp needed a new dependency.
DeltaFile
+1-0utils/bazel/llvm-project-overlay/clang-tools-extra/clang-tidy/BUILD.bazel
+1-01 files

LLVM/project b0ecd04llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

Change old code

Change-Id: I26cff6c0c5743684778f022b264c9930eeff24ce
DeltaFile
+4-2llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+4-21 files

LLVM/project 8287e00llvm/test/Instrumentation/AddressSanitizer dynamic-shadow-windows.ll

Add instrumentation test for Windows dynamic shadow memory address on x64 and arm64
DeltaFile
+20-0llvm/test/Instrumentation/AddressSanitizer/dynamic-shadow-windows.ll
+20-01 files

LLVM/project 850a54fclang/lib/Sema TreeTransform.h

[NFC] Add template disambiguation for dependent call to getAttr. (#184928)

This fixes buildbot failures for darwin targets triggered by #152403. 
- https://lab.llvm.org/buildbot/#/builders/190/builds/37864
- https://lab.llvm.org/buildbot/#/builders/23/builds/18339
DeltaFile
+1-1clang/lib/Sema/TreeTransform.h
+1-11 files

LLVM/project e9606dfllvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp AMDGPUCoExecSchedStrategy.h, llvm/test/CodeGen/AMDGPU coexec-scheduler.ll coexec-sched-effective-stall.mir

[AMDGPU] Add HWUI pressure heuristics to coexec strategy

Change-Id: I322cc670c8d923a6df23588d8a14cdaec1f49da9
DeltaFile
+601-0llvm/test/CodeGen/AMDGPU/coexec-scheduler.ll
+413-22llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+284-2llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+4-4llvm/test/CodeGen/AMDGPU/coexec-sched-effective-stall.mir
+1,302-284 files

LLVM/project a500d65mlir/lib/Dialect/OpenACC/IR OpenACCCG.cpp

[MLIR][ACC] Fix -Wunused-variable in dfe7738

There was a variable only used in an assertion which causes warnings in
non-assertions builds. Fix the issue by inlining the variable into the
assertion given it is only used in one place.
DeltaFile
+1-2mlir/lib/Dialect/OpenACC/IR/OpenACCCG.cpp
+1-21 files

LLVM/project 026b90flibcxx/include string

no dups

Created using spr 1.3.7
DeltaFile
+0-2libcxx/include/string
+0-21 files

LLVM/project 125519eclang/lib/CIR/CodeGen CIRGenBuiltinAArch64.cpp, clang/test/AST/HLSL Texture2D-vector-AST.hlsl Texture2D-scalar-AST.hlsl

no explicit limits

Created using spr 1.3.7
DeltaFile
+754-88clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp
+726-0clang/test/AST/HLSL/Texture2D-vector-AST.hlsl
+722-0clang/test/AST/HLSL/Texture2D-scalar-AST.hlsl
+560-0clang/test/SemaSYCL/sycl-kernel-launch.cpp
+364-111llvm/test/CodeGen/AMDGPU/llvm.fptrunc.round.ll
+0-439clang/test/AST/HLSL/Texture2D-AST.hlsl
+3,126-638224 files not shown
+8,782-2,526230 files

LLVM/project 283116eclang/lib/CIR/CodeGen CIRGenBuiltinAArch64.cpp, clang/test/AST/HLSL Texture2D-vector-AST.hlsl Texture2D-scalar-AST.hlsl

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+754-88clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp
+726-0clang/test/AST/HLSL/Texture2D-vector-AST.hlsl
+722-0clang/test/AST/HLSL/Texture2D-scalar-AST.hlsl
+560-0clang/test/SemaSYCL/sycl-kernel-launch.cpp
+364-111llvm/test/CodeGen/AMDGPU/llvm.fptrunc.round.ll
+400-39clang/test/CodeGen/AArch64/neon/intrinsics.c
+3,526-238224 files not shown
+8,782-2,526230 files

LLVM/project 2d67bf9clang/lib/CIR/CodeGen CIRGenBuiltinAArch64.cpp, clang/test/AST/HLSL Texture2D-vector-AST.hlsl Texture2D-scalar-AST.hlsl

no explicit limits

Created using spr 1.3.7
DeltaFile
+754-88clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp
+726-0clang/test/AST/HLSL/Texture2D-vector-AST.hlsl
+722-0clang/test/AST/HLSL/Texture2D-scalar-AST.hlsl
+560-0clang/test/SemaSYCL/sycl-kernel-launch.cpp
+364-111llvm/test/CodeGen/AMDGPU/llvm.fptrunc.round.ll
+0-439clang/test/AST/HLSL/Texture2D-AST.hlsl
+3,126-638224 files not shown
+8,782-2,526230 files

LLVM/project b57d62cclang/include/clang/AST ASTStructuralEquivalence.h

[Clang][NFC] Apply Rule of Three to AttrScopedAttrEquivalenceContext (#184905)

Static analysis flagged AttrScopedAttrEquivalenceContext as having a
user defined destructor but not having copy ctor or copy assignment. I
set them as deleted since they are not needed.
DeltaFile
+4-0clang/include/clang/AST/ASTStructuralEquivalence.h
+4-01 files

LLVM/project 08f6fedclang/test/OpenMP loop_collapse_codegen.cpp, llvm/test/CodeGen/AArch64 clmul-fixed.ll

rebase

Created using spr 1.3.7
DeltaFile
+6,520-0llvm/test/CodeGen/X86/bit-manip-i512.ll
+3,707-46llvm/test/CodeGen/AArch64/clmul-fixed.ll
+3,441-0llvm/test/MC/AMDGPU/gfx13_asm_vflat.s
+3,257-0llvm/test/CodeGen/X86/bit-manip-i256.ll
+857-0clang/test/OpenMP/loop_collapse_codegen.cpp
+706-0llvm/test/Transforms/LoopVectorize/float-induction.ll
+18,488-46437 files not shown
+31,961-4,273443 files

LLVM/project cc2c0f9clang/test/OpenMP loop_collapse_codegen.cpp, llvm/test/CodeGen/AArch64 clmul-fixed.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+6,520-0llvm/test/CodeGen/X86/bit-manip-i512.ll
+3,707-46llvm/test/CodeGen/AArch64/clmul-fixed.ll
+3,441-0llvm/test/MC/AMDGPU/gfx13_asm_vflat.s
+3,257-0llvm/test/CodeGen/X86/bit-manip-i256.ll
+857-0clang/test/OpenMP/loop_collapse_codegen.cpp
+706-0llvm/test/Transforms/LoopVectorize/float-induction.ll
+18,488-46436 files not shown
+31,959-4,273442 files

LLVM/project 68708c6clang/lib/Lex HeaderSearch.cpp ModuleMap.cpp, clang/test/Modules deprecated-upwards-relative-path.m

rebase

Created using spr 1.3.7
DeltaFile
+22-82lldb/test/API/python_api/run_locker/TestRunLocker.py
+91-0clang/test/SemaHLSL/Types/BuiltinMatrix/MatrixFloatPrecisionWarnings.hlsl
+81-0clang/test/Modules/deprecated-upwards-relative-path.m
+46-27clang/lib/Lex/HeaderSearch.cpp
+36-12clang/lib/Lex/ModuleMap.cpp
+0-44libcxx/test/std/strings/basic.string/string.cons/constexpr_initialization_stress.pass.cpp
+276-16520 files not shown
+450-24426 files

LLVM/project cf8004bllvm/include/llvm/ExecutionEngine/Orc WaitingOnGraph.h

[ORC] Remove accidental #include left in 7f6ada93f72. NFC.

left in the final commit.
DeltaFile
+0-2llvm/include/llvm/ExecutionEngine/Orc/WaitingOnGraph.h
+0-21 files

LLVM/project a69df50llvm/test/CodeGen/AMDGPU call-argument-types.ll amdgcn.bitcast.768bit.ll

[AMDGPU] Regenerated tests after rebasing on top of
users/dhruvachak/add_physical_to_gcn_trackers_after_rename.
DeltaFile
+284-572llvm/test/CodeGen/AMDGPU/call-argument-types.ll
+227-225llvm/test/CodeGen/AMDGPU/amdgcn.bitcast.768bit.ll
+180-196llvm/test/CodeGen/AMDGPU/gfx-callable-return-types.ll
+171-143llvm/test/CodeGen/AMDGPU/agpr-copy-no-free-registers.ll
+131-80llvm/test/CodeGen/AMDGPU/bf16.ll
+88-94llvm/test/CodeGen/AMDGPU/a-v-flat-atomicrmw.ll
+1,081-1,3109 files not shown
+1,477-1,70115 files