LLVM/project b889df3llvm/lib/Transforms/Instrumentation MemProfUse.cpp, llvm/test/Transforms/PGOProfile memprof-dump-matched-alloc-site.ll

[MemProf] Add option to print function GUIDs during matching (#170946)

For debugging, add -memprof-print-function-guids option that will cause
memprof matching to emit the computed function GUID for every function
definition in the module to stderr. This is helpful because the profile
only contains the function GUID and not the names.
DeltaFile
+8-0llvm/lib/Transforms/Instrumentation/MemProfUse.cpp
+5-2llvm/test/Transforms/PGOProfile/memprof-dump-matched-alloc-site.ll
+13-22 files

LLVM/project 06f9deellvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

Rebase

Created using spr 1.3.6-beta.1
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,628 files not shown
+211,870-242,6594,634 files

LLVM/project 367ebafllvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,626 files not shown
+211,867-242,6564,632 files

LLVM/project e5397f1mlir/lib/Dialect/XeGPU/Transforms XeGPUSubgroupDistribute.cpp, mlir/test/Dialect/XeGPU subgroup-distribute-unit.mlir subgroup-distribute.mlir

add unit tests and fix bugs
DeltaFile
+40-28mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp
+65-0mlir/test/Dialect/XeGPU/subgroup-distribute-unit.mlir
+1-1mlir/test/Dialect/XeGPU/subgroup-distribute.mlir
+106-293 files

LLVM/project 23591c3llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

Rebase

Created using spr 1.3.6-beta.1
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,626 files not shown
+211,867-242,6564,632 files

LLVM/project 9e98712llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,625 files not shown
+211,863-242,6524,631 files

LLVM/project 58c6006llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

Address review comments

Created using spr 1.3.6-beta.1
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,625 files not shown
+211,863-242,6524,631 files

LLVM/project 2ec72dallvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,608 files not shown
+211,746-242,5784,614 files

LLVM/project 3196a54llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

Address review comments

Created using spr 1.3.6-beta.1
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,608 files not shown
+211,746-242,5784,614 files

LLVM/project 75d6068llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,608 files not shown
+211,746-242,5784,614 files

LLVM/project acbd52bllvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

Address review comments

Created using spr 1.3.6-beta.1
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,608 files not shown
+211,746-242,5784,614 files

LLVM/project f96bf3bllvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/X86 wide-scalar-shift-by-byte-multiple-legalization.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+17,522-20,773llvm/test/CodeGen/X86/wide-scalar-shift-by-byte-multiple-legalization.ll
+8,998-11,093llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll
+8,981-11,098llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll
+4,734-0llvm/test/tools/llvm-mca/RISCV/tt-ascalon-d8/vlseg-vsseg.s
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+1,529-1,529llvm/test/tools/llvm-mca/RISCV/SpacemitX60/rvv-fp.s
+46,489-44,4934,605 files not shown
+211,736-242,5644,611 files

LLVM/project 3f3fd2fllvm/lib/Target/AArch64 AArch64AsmPrinter.cpp, llvm/lib/Transforms/Scalar SROA.cpp

[𝘀𝗽𝗿] initial version

Created using spr 1.3.6-beta.1
DeltaFile
+78-5llvm/lib/Transforms/Scalar/SROA.cpp
+73-0llvm/test/Transforms/SROA/protected-field-pointer.ll
+7-2llvm/lib/Target/AArch64/AArch64AsmPrinter.cpp
+6-0llvm/test/CodeGen/AArch64/ptrauth-irelative.ll
+164-74 files

LLVM/project e198097llvm/lib/Transforms/Scalar SROA.cpp, llvm/test/Transforms/SROA protected-field-pointer.ll

[𝘀𝗽𝗿] changes to main this commit is based on

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+78-5llvm/lib/Transforms/Scalar/SROA.cpp
+73-0llvm/test/Transforms/SROA/protected-field-pointer.ll
+151-52 files

LLVM/project 90e3ac6llvm/include/llvm-c Core.h, llvm/include/llvm/IR Instructions.h

Revert "[IR] Don't store switch case values as operands" (#170962)

Reverts llvm/llvm-project#166842

Breaks Mips LLVM tests, and LLD on bots.
See llvm/llvm-project#166842
DeltaFile
+13-22llvm/include/llvm/IR/Instructions.h
+15-18llvm/lib/IR/Instructions.cpp
+0-24llvm/include/llvm-c/Core.h
+0-23llvm/unittests/IR/InstructionsTest.cpp
+10-10llvm/lib/IR/User.cpp
+9-8llvm/lib/Target/SPIRV/SPIRVEmitIntrinsics.cpp
+47-1057 files not shown
+55-14113 files

LLVM/project 72d5fe5clang/lib/Headers cpuid.h

[clang] [doc] Added documentation to intrinsics in cpuid.h (#170507)

Added doxygen comments to 4 intrinsincs in cpuid.h
DeltaFile
+96-0clang/lib/Headers/cpuid.h
+96-01 files

LLVM/project 7e39ae7llvm/lib/Transforms/IPO LowerTypeTests.cpp, llvm/test/Transforms/LowerTypeTests import.ll

[LTT] Add `unknown` branch weights when lowering type tests with conditional
DeltaFile
+13-10llvm/test/Transforms/LowerTypeTests/import.ll
+5-1llvm/lib/Transforms/IPO/LowerTypeTests.cpp
+0-2llvm/utils/profcheck-xfail.txt
+18-133 files

LLVM/project 4294dc7llvm/include/llvm-c Core.h, llvm/include/llvm/IR Instructions.h

Revert "[IR] Don't store switch case values as operands (#166842)"

This reverts commit f26360f2150e7ff916f2aa5d5fe3ff32d1780c8c.
DeltaFile
+13-22llvm/include/llvm/IR/Instructions.h
+15-18llvm/lib/IR/Instructions.cpp
+0-24llvm/include/llvm-c/Core.h
+0-23llvm/unittests/IR/InstructionsTest.cpp
+10-10llvm/lib/IR/User.cpp
+9-8llvm/lib/Target/SPIRV/SPIRVEmitIntrinsics.cpp
+47-1057 files not shown
+55-14113 files

LLVM/project 6a148c5llvm/test/Transforms/LowerTypeTests import.ll

[NFC] Run UTC --check-globals on LowerTypeTests/import.ll
DeltaFile
+154-136llvm/test/Transforms/LowerTypeTests/import.ll
+154-1361 files

LLVM/project 472fd70llvm/utils/git code-lint-helper.py

[Github][CI] Introduce `LintHelper` class in `code-lint-helper.py`  (#168827)

This commit introduces a new `LintHelper` class to
`llvm/utils/git/code-lint-helper.py`, which lays the groundwork for
integrating additional linters (`doc8`) into Github Action workflow.

---------

Co-authored-by: EugeneZelenko <eugene.zelenko at gmail.com>
DeltaFile
+231-195llvm/utils/git/code-lint-helper.py
+231-1951 files

LLVM/project e546d0fllvm/lib/Target/RISCV RISCVTargetTransformInfo.cpp, llvm/test/Analysis/CostModel/RISCV fshl_fshr.ll

[RISCV][TTI] Add cost model for ROTL/ROTR (#170824)

A funnel shift with the same first two operands is a rotate. When
`Zbb/Zbkb` is enabled we can use the `ROL(W)/ROR(I)(W)` instruction to
represent this. Add cost model support for this.

Similar to https://github.com/llvm/llvm-project/pull/169335 for AArch64.
DeltaFile
+181-0llvm/test/Analysis/CostModel/RISCV/fshl_fshr.ll
+17-0llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
+198-02 files

LLVM/project b1ee4d0llvm/lib/Target/RISCV RISCVMergeBaseOffset.cpp

[RISCV] Refactor some code in RISCVMergeBaseOffset.NFC (#170839)

DeltaFile
+21-17llvm/lib/Target/RISCV/RISCVMergeBaseOffset.cpp
+21-171 files

LLVM/project 6d8714bclang/docs ClangIRCodeDuplication.rst index.rst

[Clang][CIR][Doc] Document CIR code duplication plans (#166457)

This adds a document describing known problems with code duplication in
the CIR codegen implementation, strategies to mitigate the risks caused
by that code duplication, and a general long-term plan for minimizing
the problem.
DeltaFile
+245-0clang/docs/ClangIRCodeDuplication.rst
+1-1clang/docs/index.rst
+246-12 files

LLVM/project ee77c58llvm/lib/Target/AMDGPU AMDGPUPromoteAlloca.cpp, llvm/test/CodeGen/AMDGPU promote-alloca-cfg.ll

Reland "AMDGPU/PromoteAlloca: Simplify how deferred loads work (#170510)" (#170955)

The second pass of promotion to vector can be quite simple. Reflect that
simplicity in the code for better maintainability.

v2:
- don't put placeholders into the SSAUpdater, and add a test that shows
the problem
DeltaFile
+36-46llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
+33-0llvm/test/CodeGen/AMDGPU/promote-alloca-cfg.ll
+69-462 files

LLVM/project 488575alibc/src/__support/RPC rpc_server.h, libc/src/stdio/baremetal vprintf.cpp

[libc] Refactor static polymorphism in WriteBuffer (NFC).

There are three flavors of WriteBuffer currently, all of which
could be passed into printf_core::Writer class. It's a tricky
class, since it chooses a flavor-specific logic either based
on runtime dispatch (to save code size and prevent
generating three versions of the entirety of printf_core),
or based on template arguments (to avoid dealing with
function pointers in codegen for FILL_BUFF_AND_DROP_OVERFLOW path.

Refactor this somewhat convoluted logic to have three
concrete subclasses inheriting from the templated base
class, and use static polymorphism with reinterpret_cast
to implement dispatching above. Now we can actually
have flavor-specific fields, constructors, and methods
(e.g. "flush_to_stream" is now a method of FlushingBuffer),
and the code on the user side is cleaner: the complexity
of enabling/disabling runtime-dispatch and using proper
template arguments is now localized in writer.h.

    [4 lines not shown]
DeltaFile
+98-56libc/src/stdio/printf_core/writer.h
+29-35libc/test/src/stdio/printf_core/writer_test.cpp
+6-10libc/test/src/stdio/printf_core/converter_test.cpp
+3-9libc/src/__support/RPC/rpc_server.h
+4-6libc/src/stdio/printf_core/vasprintf_internal.h
+4-4libc/src/stdio/baremetal/vprintf.cpp
+144-12011 files not shown
+162-15417 files

LLVM/project 67b208ellvm/lib/Target/AMDGPU AMDGPUPromoteAlloca.cpp, llvm/test/CodeGen/AMDGPU promote-alloca-multidim.ll promote-alloca-negative-index.ll

Reland "AMDGPU/PromoteAlloca: Always use i32 for indexing (#170511)"

Create more canonical code that may even lead to slightly better
codegen.

commit-id:a3832fee
DeltaFile
+15-13llvm/test/CodeGen/AMDGPU/promote-alloca-multidim.ll
+7-6llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
+8-4llvm/test/CodeGen/AMDGPU/promote-alloca-negative-index.ll
+1-1llvm/test/CodeGen/AMDGPU/promote-alloca-vector-to-vector.ll
+31-244 files

LLVM/project b2d31cbllvm/lib/Target/AMDGPU AMDGPUPromoteAlloca.cpp, llvm/test/CodeGen/AMDGPU promote-alloca-cfg.ll

Reland "AMDGPU/PromoteAlloca: Simplify how deferred loads work (#170510)"

The second pass of promotion to vector can be quite simple. Reflect that
simplicity in the code for better maintainability.

v2:
- don't put placeholders into the SSAUpdater, and add a test that shows
  the problem

commit-id:d6d2255a
DeltaFile
+36-46llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
+33-0llvm/test/CodeGen/AMDGPU/promote-alloca-cfg.ll
+69-462 files

LLVM/project 492effb.github/workflows release-binaries.yml

Enable attestation creation
DeltaFile
+2-3.github/workflows/release-binaries.yml
+2-31 files

LLVM/project 9e53770clang/include/clang/Analysis/FlowSensitive/Models UncheckedStatusOrAccessModel.h, clang/lib/Analysis/FlowSensitive/Models UncheckedStatusOrAccessModel.cpp

[𝘀𝗽𝗿] initial version

Created using spr 1.3.7
DeltaFile
+621-0clang/unittests/Analysis/FlowSensitive/UncheckedStatusOrAccessModelTestFixture.cpp
+519-0clang/lib/Analysis/FlowSensitive/Models/UncheckedStatusOrAccessModel.cpp
+94-0clang/unittests/Analysis/FlowSensitive/MockHeaders.cpp
+3-1clang/include/clang/Analysis/FlowSensitive/Models/UncheckedStatusOrAccessModel.h
+1,237-14 files

LLVM/project 439e0a6clang/include/clang/Analysis/FlowSensitive/Models UncheckedStatusOrAccessModel.h, clang/lib/Analysis/FlowSensitive/Models UncheckedStatusOrAccessModel.cpp

[𝘀𝗽𝗿] changes to main this commit is based on

Created using spr 1.3.7

[skip ci]
DeltaFile
+621-0clang/unittests/Analysis/FlowSensitive/UncheckedStatusOrAccessModelTestFixture.cpp
+519-0clang/lib/Analysis/FlowSensitive/Models/UncheckedStatusOrAccessModel.cpp
+90-0clang/unittests/Analysis/FlowSensitive/MockHeaders.cpp
+3-1clang/include/clang/Analysis/FlowSensitive/Models/UncheckedStatusOrAccessModel.h
+1,233-14 files