LLVM/project 998b28flldb/packages/Python/lldbsuite/test/tools/lldb-dap dap_server.py, lldb/test/API/tools/lldb-dap/server TestDAP_server.py

[lldb-dap] Refactoring lldb-dap port listening mode to allow multiple connections. (#116392)

This adjusts the lldb-dap listening mode to accept multiple clients.
Each client initializes a new instance of DAP and an associated
`lldb::SBDebugger` instance.

The listening mode is configured with the `--connection` option and
supports listening on a port or a unix socket on supported platforms.

When running in server mode launch and attach performance should
be improved by lldb sharing symbols for core libraries between debug
sessions.
DeltaFile
+244-150lldb/tools/lldb-dap/lldb-dap.cpp
+107-0lldb/test/API/tools/lldb-dap/server/TestDAP_server.py
+75-27lldb/packages/Python/lldbsuite/test/tools/lldb-dap/dap_server.py
+66-9lldb/tools/lldb-dap/DAP.cpp
+13-9lldb/tools/lldb-dap/Options.td
+15-3lldb/tools/lldb-dap/DAP.h
+520-1986 files not shown
+544-20512 files

LLVM/project 1bb4306llvm/test/CodeGen/AMDGPU llvm.amdgcn.sched.group.barrier.ll shufflevector.v2i64.v8i64.ll, llvm/test/CodeGen/Thumb2 mve-vst3.ll

PeepholeOpt: Allow introducing subregister uses on reg_sequence (#127052)

This reverts d246cc618adc52fdbd69d44a2a375c8af97b6106. We now handle
composing subregister extracts through reg_sequence.
DeltaFile
+444-444llvm/test/CodeGen/AMDGPU/llvm.amdgcn.sched.group.barrier.ll
+304-304llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+171-298llvm/test/CodeGen/AMDGPU/identical-subrange-spill-infloop.ll
+220-215llvm/test/CodeGen/Thumb2/mve-vst3.ll
+157-211llvm/test/CodeGen/AMDGPU/load-global-i32.ll
+178-178llvm/test/CodeGen/AMDGPU/fptoi.i128.ll
+1,474-1,65055 files not shown
+3,712-3,98161 files

LLVM/project 93b2e47flang/lib/Optimizer/Transforms CUFOpConversion.cpp, flang/test/Fir/CUDA cuda-data-transfer.fir

[flang][cuda] Avoid assign element mismatch when doing data transfer from a constant (#128252)

Currently when we do a CUDA data transfer from a constant, we embox it
and delegate the assignment to the runtime. When the type of the
constant is not exactly the same as the destination descriptor, the
runtime will emit an assignment mismatch error.

Convert the constant when necessary so the assignment is fine.
DeltaFile
+19-7flang/lib/Optimizer/Transforms/CUFOpConversion.cpp
+22-0flang/test/Fir/CUDA/cuda-data-transfer.fir
+41-72 files

LLVM/project 5ff934ellvm/include/llvm/CodeGen AsmPrinter.h, llvm/lib/CodeGen/AsmPrinter AsmPrinter.cpp

fmt

Created using spr 1.3.4
DeltaFile
+2-2llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
+2-2llvm/lib/Target/AArch64/AArch64AsmPrinter.cpp
+0-1llvm/include/llvm/CodeGen/AsmPrinter.h
+4-53 files

LLVM/project 700b66dclang/test/CodeGen memtag-globals-asm.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+11-0clang/test/CodeGen/memtag-globals-asm.cpp
+11-01 files

LLVM/project 1df09beclang/test/CodeGen memtag-globals-asm.cpp

fix

Created using spr 1.3.4
DeltaFile
+5-5clang/test/CodeGen/memtag-globals-asm.cpp
+5-51 files

LLVM/project 63c0a82clang/test/CodeGen memtag-globals-asm.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+11-0clang/test/CodeGen/memtag-globals-asm.cpp
+11-01 files

LLVM/project ea09024clang/test/CodeGen memtag-globals-asm.cpp

fmt

Created using spr 1.3.4
DeltaFile
+12-10clang/test/CodeGen/memtag-globals-asm.cpp
+12-101 files

LLVM/project 12fc6d0clang/test/CodeGen memtag-globals-asm.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+11-0clang/test/CodeGen/memtag-globals-asm.cpp
+11-01 files

LLVM/project 24945d0clang/test/CodeGen memtag-globals-asm.cpp

add missing

Created using spr 1.3.4
DeltaFile
+1-0clang/test/CodeGen/memtag-globals-asm.cpp
+1-01 files

LLVM/project 1ffdefbclang/test/CodeGen memtag-globals-asm.cpp

rebase

Created using spr 1.3.4
DeltaFile
+10-0clang/test/CodeGen/memtag-globals-asm.cpp
+10-01 files

LLVM/project 64e954bclang/test/CodeGen memtag-globals-asm.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+10-0clang/test/CodeGen/memtag-globals-asm.cpp
+10-01 files

LLVM/project 19e50c7clang/test/CodeGen memtag-globals-asm.cpp

[𝘀𝗽𝗿] initial version

Created using spr 1.3.4
DeltaFile
+10-0clang/test/CodeGen/memtag-globals-asm.cpp
+10-01 files

LLVM/project e45a72allvm/test/CodeGen/AMDGPU shufflevector.v2i64.v8i64.ll shufflevector.v2bf16.v8bf16.ll

refactor

Created using spr 1.3.4
DeltaFile
+30,641-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i64.v8i64.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2bf16.v8bf16.ll
+27,655-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f16.v8f16.ll
+27,139-0llvm/test/CodeGen/AMDGPU/shufflevector.v2i16.v8i16.ll
+25,933-0llvm/test/CodeGen/AMDGPU/shufflevector.v2p3.v8p3.ll
+25,933-0llvm/test/CodeGen/AMDGPU/shufflevector.v2f32.v8f32.ll
+164,956-015,561 files not shown
+1,914,489-576,76815,567 files

LLVM/project 5f8da7ellvm/lib/Analysis DependenceAnalysis.cpp, llvm/test/Analysis/DependenceAnalysis PR51512.ll

[DA] remove wrap-around check from affine definition (#116632)

Fix https://github.com/llvm/llvm-project/issues/51512
by reverting a part of commit
https://github.com/llvm/llvm-project/commit/c0661aeaf8daf371023cf5669be4bd9b428882d0
that modified the definition affine subscripts.
DeltaFile
+44-0llvm/test/Analysis/DependenceAnalysis/PR51512.ll
+0-8llvm/lib/Analysis/DependenceAnalysis.cpp
+44-82 files

LLVM/project d175aedmlir/include/mlir/Dialect/Tosa/IR TosaProfileCompliance.h

[Tosa] Add necessary `unordered_map` header file (#128248)

Ref:
https://github.com/llvm/llvm-project/pull/126992#discussion_r1966320748
DeltaFile
+2-0mlir/include/mlir/Dialect/Tosa/IR/TosaProfileCompliance.h
+2-01 files

LLVM/project 7ae2b1ebolt/lib/Core BinaryFunction.cpp, bolt/test/X86 entry-point-fallthru.s

[𝘀𝗽𝗿] initial version

Created using spr 1.3.4
DeltaFile
+24-0bolt/test/X86/entry-point-fallthru.s
+3-2bolt/lib/Core/BinaryFunction.cpp
+27-22 files

LLVM/project 7cfab6cclang/lib/CodeGen HLSLBufferLayoutBuilder.cpp

[HLSL] add parentheses in assertion to eliminate -Wparentheses warning (#128246)

Fixes #128210
DeltaFile
+3-3clang/lib/CodeGen/HLSLBufferLayoutBuilder.cpp
+3-31 files

LLVM/project aef63c5llvm/lib/Target/RISCV RISCVInstrInfoXRivos.td RISCVFeatures.td, llvm/lib/Target/RISCV/Disassembler RISCVDisassembler.cpp

[RISCV] Assembler support for XRivosVizip (#127694)

This implements assembler support for the XRivosVizip custom/vendor
extension from Rivos Inc. which is defined in:
https://github.com/rivosinc/rivos-custom-extensions (See
src/xrivosvizip.adoc)

Codegen support will follow in a separate change.
DeltaFile
+59-0llvm/test/MC/RISCV/xrivosvizip-valid.s
+27-0llvm/lib/Target/RISCV/RISCVInstrInfoXRivos.td
+10-0llvm/test/MC/RISCV/xrivosvizip-invalid.s
+9-0llvm/lib/Target/RISCV/RISCVFeatures.td
+3-0llvm/lib/Target/RISCV/Disassembler/RISCVDisassembler.cpp
+1-0llvm/unittests/TargetParser/RISCVISAInfoTest.cpp
+109-02 files not shown
+111-08 files

LLVM/project 5d0c5c6mlir/lib/Dialect/Arith/IR ArithOps.cpp, mlir/test/Dialect/Arith canonicalize.mlir

[MLIR][ARITH] Adds missing foldings for truncf (#128096)

This patch is mainly to deal with folding `truncf`, as follows:
`truncf(extf(a))` -> `a`, if `a` has the same bitwidth as the result
`truncf(extf(a))` -> `truncf(a)`, if `a` has larger bitwidth than the
result

DeltaFile
+39-0mlir/test/Dialect/Arith/canonicalize.mlir
+21-0mlir/lib/Dialect/Arith/IR/ArithOps.cpp
+60-02 files

LLVM/project 6038fd4flang/include/flang/Optimizer/Builder IntrinsicCall.h, flang/lib/Optimizer/Builder IntrinsicCall.cpp

[flang][cuda] Lower atomiccas, atomicxor and atomicexch (#128242)

Lower atomiccas, atomicxor and atomicexch to corresponding llvm atomic
operations.
DeltaFile
+106-48flang/module/cudadevice.f90
+69-0flang/lib/Optimizer/Builder/IntrinsicCall.cpp
+12-12flang/test/Lower/CUDA/cuda-device-proc.cuf
+4-1flang/include/flang/Optimizer/Builder/IntrinsicCall.h
+191-614 files

LLVM/project 8ffdc3bllvm/lib/Transforms/Vectorize SLPVectorizer.cpp, llvm/test/Transforms/SLPVectorizer/RISCV reordered-buildvector-scalars.ll

[SLP]Fix a crash when checking a scalar in a reordered buildvector node

Need to check reordered scalars, not the original ones, to correctly
check proper scalar.
DeltaFile
+184-0llvm/test/Transforms/SLPVectorizer/RISCV/reordered-buildvector-scalars.ll
+1-1llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
+185-12 files

LLVM/project 92e02adllvm/lib/Transforms/IPO MemProfContextDisambiguation.cpp, llvm/test/ThinLTO/X86 memprof-recursive.ll

[MemProf] Display backedges with dotted line in dot graphs (#128235)

Add checking of this behavior in the postbuild dot graphs, facilitated
by PR128226 which marked these edges at the end of the graph building.
DeltaFile
+9-5llvm/lib/Transforms/IPO/MemProfContextDisambiguation.cpp
+8-0llvm/test/ThinLTO/X86/memprof-recursive.ll
+8-0llvm/test/Transforms/MemProfContextDisambiguation/recursive.ll
+25-53 files

LLVM/project 79261d4llvm/lib/Target/NVPTX NVPTXISelDAGToDAG.cpp NVPTXISelLowering.cpp, llvm/test/CodeGen/NVPTX local-stack-frame.ll variadics-backend.ll

[NVPTX][InferAS] assume alloca instructions are in local AS (#121710)

DeltaFile
+185-43llvm/test/CodeGen/NVPTX/local-stack-frame.ll
+49-47llvm/test/CodeGen/NVPTX/variadics-backend.ll
+21-22llvm/lib/Target/NVPTX/NVPTXISelDAGToDAG.cpp
+16-15llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll
+18-4llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
+11-9llvm/test/CodeGen/NVPTX/indirect_byval.ll
+300-1405 files not shown
+339-15211 files

LLVM/project ca0406dllvm/docs DeveloperPolicy.rst

DeveloperPolicy: Update commit access requirements (#127006)

See https://discourse.llvm.org/t/rfc-commit-access-criteria/84073
DeltaFile
+13-3llvm/docs/DeveloperPolicy.rst
+13-31 files

LLVM/project 6b3e765libc/config/gpu/amdgpu entrypoints.txt, libc/config/gpu/nvptx entrypoints.txt

[libc] Enable 'strftime' for the GPU targets (#128220)

Summary:
These should allow us to build with locale support for libcxx.
DeltaFile
+2-0libc/config/gpu/amdgpu/entrypoints.txt
+2-0libc/config/gpu/nvptx/entrypoints.txt
+2-0libc/docs/gpu/support.rst
+6-03 files

LLVM/project 7f5a2cbutils/bazel/llvm-project-overlay/libc BUILD.bazel, utils/bazel/llvm-project-overlay/libc/test/src/stdio BUILD.bazel

[libc][bazel] add targets to build the scanf family (#128082)

Now that scanf is a little cleaner, this patch adds rules to build it
via bazel.
DeltaFile
+170-0utils/bazel/llvm-project-overlay/libc/BUILD.bazel
+40-0utils/bazel/llvm-project-overlay/libc/test/src/stdio/BUILD.bazel
+210-02 files

LLVM/project e6d4fd0llvm/lib/Target/AMDGPU SIISelLowering.cpp SIISelLowering.h, llvm/test/CodeGen/AMDGPU fmaximum3.ll fminimum3.ll

AMDGPU: Widen f16 minimum/maximum to v2f16 on gfx950 (#128121)

Unfortunately we only have the vector versions of v2f16 minimum3
and maximum. Widen to v2f16 so we can lower as minimum333(x, y, y).

(cherry picked from commit e729dc759d052de122c8a918fe51b05ac796bb50)
DeltaFile
+434-255llvm/test/CodeGen/AMDGPU/fmaximum3.ll
+434-255llvm/test/CodeGen/AMDGPU/fminimum3.ll
+30-36llvm/test/CodeGen/AMDGPU/llvm.maximum.f16.ll
+30-36llvm/test/CodeGen/AMDGPU/llvm.minimum.f16.ll
+37-3llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+1-0llvm/lib/Target/AMDGPU/SIISelLowering.h
+966-5856 files

LLVM/project e0c4a33clang/lib/Headers gpuintrin.h nvptxintrin.h, libc/test/integration/src/__support/GPU scan_reduce.cpp

[Clang] Fix cross-lane scan when given divergent lanes (#127703)

Summary:
The scan operation implemented here only works if there are contiguous
ones in the executation mask that can be used to propagate the result.
There are two solutions to this, one is to enter 'whole-wave-mode' and
forcibly turn them back on, or to do this serially. This implementation
does the latter because it's more portable, but checks to see if the
parallel fast-path is applicable.

Needs to be backported for correct behavior and because it fixes a
failing libc test.

(cherry picked from commit 6cc7ca084a5bbb7ccf606cab12065604453dde59)
DeltaFile
+49-25clang/lib/Headers/gpuintrin.h
+49-0libc/test/integration/src/__support/GPU/scan_reduce.cpp
+4-1clang/lib/Headers/nvptxintrin.h
+102-263 files

LLVM/project 43a04b1libcxx/include/__locale_dir locale_base_api.h, libcxx/include/__locale_dir/support bsd_like.h windows.h

[libc++] Reduce the dependency of the locale base API on the base system from the headers (#117764)

Many parts of the locale base API are only required when building the
shared/static library, but not from the headers. Document those
functions and carve out a few of those that don't work when
_XOPEN_SOURCE is defined to something old.

Fixes #117630

(cherry picked from commit f00b32e2d0ee666d32f1ddd0c687e269fab95b44)
DeltaFile
+36-20libcxx/include/__locale_dir/locale_base_api.h
+53-0libcxx/test/libcxx/xopen_source.gen.py
+14-8libcxx/include/__locale_dir/support/bsd_like.h
+13-5libcxx/include/__locale_dir/support/windows.h
+6-3libcxx/include/__locale_dir/support/fuchsia.h
+6-2libcxx/include/__locale_dir/support/no_locale/characters.h
+128-386 files