1,061,015 commits found in 154 milliseconds
LLVM /project 7bb5287 — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll rebase
Created using spr 1.3.7
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,937 files not shown +181,759 -66,566 1,943 files
LLVM /project 7a9558d — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll [𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.7
[skip ci]
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,937 files not shown +181,759 -66,566 1,943 files
LLVM /project 353a4e3 — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll rebase
Created using spr 1.3.7
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,893 files not shown +181,521 -66,215 1,899 files
LLVM /project ea663af — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll [𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.7
[skip ci]
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,893 files not shown +181,521 -66,215 1,899 files
LLVM /project a45b6b0 — llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll rebase
Created using spr 1.3.7
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,896 files not shown +181,885 -66,320 1,902 files
LLVM /project 77a2da1 — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll [𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.7
[skip ci]
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,896 files not shown +181,885 -66,320 1,902 files
LLVM /project a218593 — llvm/test/CodeGen/AMDGPU maximumnum.bf16.ll minimumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll rebase
Created using spr 1.3.7
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,894 files not shown +181,522 -66,279 1,900 files
LLVM /project 8863fd3 — llvm/test/CodeGen/AMDGPU minimumnum.bf16.ll maximumnum.bf16.ll, llvm/test/CodeGen/AMDGPU/GlobalISel insertelement.ll [𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.7
[skip ci]
Delta File +4,582 -5,914 llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll +6,877 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-sve-instructions.s +3,326 -2,794 llvm/test/CodeGen/AMDGPU/minimumnum.bf16.ll +3,326 -2,794 llvm/test/CodeGen/AMDGPU/maximumnum.bf16.ll +5,336 -0 llvm/test/tools/llvm-mca/AArch64/Cortex/C1Ultra-writeback.s +2,843 -799 llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.ll +26,290 -12,301 1,894 files not shown +181,522 -66,279 1,900 files
LLVM /project 6643213 — mlir/lib/Dialect/SPIRV/IR SPIRVOps.cpp, mlir/test/Dialect/SPIRV/IR structure-ops.mlir [mlir][SPIR-V] Reject initializer on Import-linkage GlobalVariable (#192302)
Per the SPIR-V spec, a module-scope OpVariable with Import linkage must
not have an initializer LLVM /project 47f1c1a — clang/include/clang/ScalableStaticAnalysisFramework/Analyses/EntityPointerLevel EntityPointerLevel.h, clang/lib/ScalableStaticAnalysisFramework/Analyses SSAFAnalysesCommon.cpp SSAFAnalysesCommon.h clean up
LLVM /project 90f53aa — llvm/lib/Target/AMDGPU SIInstructions.td, llvm/test/CodeGen/AMDGPU add.v2i16.ll sub.v2i16.ll [AMDGPU] Add true16 patterns for build_vector (vgpr, 0)
It is shorter than VOP3 and instruction and in some cases
can save a second move.
[AMDGPU] Enable true16 pattern to build vectors (0, vgpr) (#191896)
Fixes: https://github.com/llvm/llvm-project/issues/190796 LLVM /project ae4a02f — llvm/lib/Target/AMDGPU AMDGPURegBankLegalizeRules.cpp, llvm/test/CodeGen/AMDGPU asyncmark-gfx12plus.ll llvm.amdgcn.global.load.async.to.lds.ll AMDGPU/GlobalISel: RegBankLegalize rules for async LDS loads (#192179)
Add RegBankLegalize rules for async LDS load intrinsics:
llvm.amdgcn.asyncmark
llvm.amdgcn.wait.asyncmark
llvm.amdgcn.global.load.async.to.lds.b8
llvm.amdgcn.global.load.async.to.lds.b32
llvm.amdgcn.global.load.async.to.lds.b64
llvm.amdgcn.global.load.async.to.lds.b128 LLVM /project 5c92853 — clang/include/clang/Serialization InMemoryModuleCache.h, clang/lib/Frontend CompilerInstance.cpp [clang] Store size & mtime in in-memory module cache (#190207)
In this PR, the in-memory module cache now stores the size and
modification time of PCM files. This is needed so that the
`ModuleManager` doesn't need to consult the file system to obtain this
information, which _might_ be in a different state than when we stored
the PCM file buffer into the in-memory cache. LLVM /project 1202ddc — llvm/lib/Target/AMDGPU AMDGPUSearchableTables.td, llvm/test/Analysis/UniformityAnalysis/AMDGPU always_uniform.ll [AMDGPU] Mark s_get_*_barrier_state intrinsics always uniform (#192190)
Both intrinsics return a 32-bit SGPR value containing the barrier's
member count and signal count. [llvm][DebugInfo] Use formatv in DWARFAcceleratorTable (#191981)
This relates to #35980. LLVM /project 957f6f6 — llvm/lib/Target/AMDGPU AMDGPURegBankLegalizeHelper.cpp AMDGPURegBankLegalizeRules.cpp, llvm/test/CodeGen/AMDGPU llvm.amdgcn.s.buffer.prefetch.data.ll AMDGPU/GlobalISel: RegBankLegalize rules for G_AMDGPU_S_BUFFER_PREFETCH (#191315) [compiler-rt][Fuchsia] Use dynamic shadow global in hwasan runtime (#192148)
For now, the global is still default initialized to zero. LLVM /project 35dcb5c — clang/lib/CIR/CodeGen CIRGenDecl.cpp CIRGenFunction.h, clang/test/CIR/CodeGen cleanup-automatic-eh.cpp [CIR] Add EH handling for lifetime extended cleanups (#192305)
This adds code to call pushDestroyAndDeferDeactivation from the
pushLifetimeExtendedDestroy function. This was needed to generate the
correct code for lifetime extended cleanups when exceptions are enabled.
An extended version of the cleanup with automatic storage duration is
used as a test case.
To make this work correctly, I had to add a CleanupDeactivationScope to
RunCleanupsScope and force deactivation when forceCleanup is called.
This matches the corresponding code in classic codegen.
I surveyed other places where classic codegen is using
CleanupDeactivationScope and added a MissingFeatures marker in one
location where it was not previously marked. Other places where it was
missing were already marked in this way. LLVM /project e540f80 — flang/include/flang/Optimizer/Dialect FIROps.td, flang/lib/Optimizer/Dialect FIROps.cpp [flang] implements a rewrite pattern to constant fold fir::BoxEleSizeOp (#192320)
Implements a rewrite pattern to constant fold an `fir::BoxEleSizeOp`
when possible. LLVM /project 89e736d — flang/lib/Optimizer/Analysis AliasAnalysis.cpp, flang/test/Analysis/AliasAnalysis modref-call-memory-effects.fir [flang][test] Experimental support of MemoryEffectOpInterface for fir.call. (#191580)
I would like to experiment with `fir.call` implementing
`MemoryEffectOpInterface`. So the main change is the fall-through
path in FIR AA. It should be NFC for Flang. LLVM /project 25ccdfa — clang/lib/ScalableStaticAnalysisFramework/Analyses SSAFAnalysesCommon.h fix build issue
LLVM /project bae2ead — clang/lib/ScalableStaticAnalysisFramework/Analyses SSAFAnalysesCommon.cpp fix merge issues
concise findCXXNewExpr
[CUDA] Change __CUDACC__ definition to 1 (#189457)
I recently encountered an issue where `nccl` used `#if __CUDACC__` ,
assuming `__CUDACC__` is not only defined but having a #if-able value.
https://github.com/NVIDIA/nccl/blob/v2.28.3-1/src/include/nccl_device/coop.h#L18
Looking at nvcc invocation, I see that:
```
echo "" | nvcc -x cu -E -Xcompiler -dM - | grep __CUDACC__
#define __CUDACC__ 1
```
Changing __CUDACC__ to 1 to match what NVIDIA downstream code
assumptions. format
LLVM /project e9c0b9c — clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvzip/policy/non-overloaded vpaire.c vpairo.c, clang/test/CodeGen/RISCV/rvv-intrinsics-autogenerated/zvzip/policy/overloaded vpairo.c Merge branch 'users/ziqingluo/PR-172429193 -2-split-2' into users/ziqingluo/PR-172429193 -2-split-3
Conflicts:
clang/lib/ScalableStaticAnalysisFramework/Analyses/UnsafeBufferUsage/UnsafeBufferUsageExtractor.cpp
add docs
Add a comment
Change-Id: I447f7f1fb185b18924cfd98249b5a0a05fef2484
LLVM /project e7db400 — llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.h AMDGPUCoExecSchedStrategy.cpp Formatting
Change-Id: I3d89fba145471141ef945b1de15330caa245e82d