LLVM/project 85c7a7fllvm/lib/MC DXContainerRootSignature.cpp

[DirectX] Don't byte-swap returned byte-offset (#155860)

- The returned byte offset from `rewriteOffsetToCurrentByte` should not
be byte-swapped as it will be compared and interpreted as a uint32_t in
its uses

This commit corrects build failures that hit an assert on big-endian
builds
DeltaFile
+3-2llvm/lib/MC/DXContainerRootSignature.cpp
+3-21 files

LLVM/project 4a7205flibclc CMakeLists.txt

libclc: CMake: include GetClangResourceDir (#155836)

`get_clang_resource_dir` is not guarantee to be there. Make sure of it
by including `GetClangResourceDir`.
DeltaFile
+1-0libclc/CMakeLists.txt
+1-01 files

LLVM/project 403986ellvm/lib/Target/AMDGPU/MCTargetDesc AMDGPUInstPrinter.cpp AMDGPUInstPrinter.h

[AMDGPU] Common up code from AMDGPUInstPrinter::printImmediate64. NFC. (#155882)

Introduce a new helper function printLiteral64.
DeltaFile
+13-14llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp
+2-0llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.h
+15-142 files

LLVM/project 038e91bllvm/include/llvm/IR ProfDataUtils.h, llvm/lib/IR ProfDataUtils.cpp Verifier.cpp

[WPD] set the function entry count
DeltaFile
+203-0llvm/test/Transforms/WholeProgramDevirt/branch-funnel-profile.ll
+43-10llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
+10-2llvm/lib/IR/ProfDataUtils.cpp
+4-5llvm/lib/IR/Verifier.cpp
+5-1llvm/include/llvm/IR/ProfDataUtils.h
+265-185 files

LLVM/project 8f482d5lld/test/ELF loongarch-call36.s, llvm/lib/Target/LoongArch/MCTargetDesc LoongArchMCTargetDesc.cpp

Revert "[sancov][LoongArch] Resolve pcaddu18i+jirl in evaluateBranch and teach sancov" (#155879)

Reverts llvm/llvm-project#155371

Breaks ubsan bots.
DeltaFile
+0-78llvm/lib/Target/LoongArch/MCTargetDesc/LoongArchMCTargetDesc.cpp
+1-6llvm/tools/sancov/sancov.cpp
+3-3lld/test/ELF/loongarch-call36.s
+4-873 files

LLVM/project d77cf57llvm/test lit.cfg.py, llvm/test/CodeGen/NVPTX clusterlaunchcontrol-multicast.ll f16-instructions.ll

[lit] Refactor available `ptxas` features (#154439)

ToT `lit` currently assumes that a given `ptxas` version supports all
capabilities of prior `ptxas` releases. This approach was flexible
enough to support the removal of 32-bit address compilation from `ptxas`
in CUDA 12.1, but it struggles with the removal of Volta and prior
compilation in CUDA 13.0.

To deal with this, this PR refactors how `lit` defines the set of
features available for a given `ptxas` version. It invokes `ptxas` not
just to get its version, but also to get the list of supported SMs,
supported PTX ISA versions, and support for 32-bit compilation.

This approach should be flexible enough to deal with the changing
support matrix of `ptxas` as it goes forward. One obvious downside is
that this relies on parsing the `stdout` of `ptxas`, something that's
inherently unstable. But, IMO, this is something that we can fix as
needed.
DeltaFile
+125-63llvm/test/lit.cfg.py
+6-6llvm/test/CodeGen/NVPTX/clusterlaunchcontrol-multicast.ll
+4-4llvm/test/CodeGen/NVPTX/f16-instructions.ll
+4-4llvm/test/CodeGen/NVPTX/f16-abs.ll
+3-3llvm/test/CodeGen/NVPTX/atomics-sm70.ll
+3-3llvm/test/CodeGen/NVPTX/fence-nocluster.ll
+145-83168 files not shown
+366-304174 files

LLVM/project d3520a6clang/test/CodeGen/X86 sse2-builtins.c sse42-builtins.c

[X86] Add -fexperimental-new-constant-interpreter test coverage to the MMX/SSE constexpr test files (#155857)

Partial fix for #155814
DeltaFile
+11-0clang/test/CodeGen/X86/sse2-builtins.c
+9-0clang/test/CodeGen/X86/sse42-builtins.c
+9-0clang/test/CodeGen/X86/ssse3-builtins.c
+9-0clang/test/CodeGen/X86/mmx-builtins.c
+9-0clang/test/CodeGen/X86/sse41-builtins.c
+5-0clang/test/CodeGen/X86/sse3-builtins.c
+52-01 files not shown
+57-07 files

LLVM/project 613a124mlir/cmake/modules AddMLIRPython.cmake, mlir/examples/standalone CMakeLists.txt

Update AddMLIRPython.cmake
DeltaFile
+8-5mlir/cmake/modules/AddMLIRPython.cmake
+5-5mlir/examples/standalone/python/CMakeLists.txt
+2-0mlir/examples/standalone/CMakeLists.txt
+15-103 files

LLVM/project 14485acllvm/lib/Target/AMDGPU/MCTargetDesc AMDGPUMCCodeEmitter.cpp

[AMDGPU] Determine MCFixupKind in a more general way. NFCI. (#155864)

DeltaFile
+3-9llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCCodeEmitter.cpp
+3-91 files

LLVM/project f2e6ca8mlir/lib/Dialect/Vector/Transforms VectorDistribute.cpp, mlir/test/Dialect/Vector vector-warp-distribute.mlir

[MLIR][Vector] Add warp distribution for `vector.step` op (#155425)

This PR adds a distribution pattern for
[`vector.step`](https://mlir.llvm.org/docs/Dialects/Vector/#vectorstep-vectorstepop)
op.

The result of the step op is a vector containing a sequence
`[0,1,...,N-1]`. For the warp distribution, we consider a vector with `N
== warp_size` (think SIMD). Distributing it to SIMT, means that each
lane is represented by a thread/lane id scalar.

More complex cases with the support for warp size multiples (e.g.,
`[0,1,...,2*N-1]`) require additional layout information to be handled
properly. Such support may be added later.

The lane id scalar is wrapped into a `vector<1xindex>` to emulate the
sequence distribution result.
Other than that, the distribution is similar to that of
`arith.constant`.
DeltaFile
+47-1mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp
+32-0mlir/test/Dialect/Vector/vector-warp-distribute.mlir
+79-12 files

LLVM/project 492089ecompiler-rt/lib/asan asan_errors.h

Provide ErrorBadParamsToCopyContiguousContainerAnnotations a more cor… (#139870)

…rect 'reason' when constructing ErrorBase

Co-authored-by: Tacet <advenam.tacet at gmail.com>
DeltaFile
+1-1compiler-rt/lib/asan/asan_errors.h
+1-11 files

LLVM/project ca5d195llvm/utils/TableGen DecoderEmitter.cpp

[TableGen][DecoderEmitter] Simplify emitSoftFailTableEntry (NFC) (#155863)

DeltaFile
+11-30llvm/utils/TableGen/DecoderEmitter.cpp
+11-301 files

LLVM/project 21e1ab3flang/include/flang/Evaluate match.h, flang/lib/Semantics check-omp-atomic.cpp

[flang][OpenMP] Reassociate floating-point ATOMIC update expressions (#155840)

This is a follow-up to PR153488, this time the reassociation is enabled
for floating-point expressions, but only when associative-nath is
enabled in the language options. This can be done via -ffast-math on the
command line.
DeltaFile
+100-0flang/test/Lower/OpenMP/atomic-update-reassoc-fp.f90
+27-4flang/lib/Semantics/check-omp-atomic.cpp
+18-3flang/include/flang/Evaluate/match.h
+145-73 files

LLVM/project 4f6032f.ci cache_lit_timing_files.py

[CI] Support using blob prefix for downloading

So that uploading and downloading use the same file paths. Also refactor
everything so that _get_blob_prefix is the common implementation.
DeltaFile
+7-4.ci/cache_lit_timing_files.py
+7-41 files

LLVM/project 78396c8mlir/examples/standalone/test/python smoketest.py

Update smoketest.py
DeltaFile
+3-1mlir/examples/standalone/test/python/smoketest.py
+3-11 files

LLVM/project 0f435e8lld/test/ELF loongarch-call36.s, llvm/lib/Target/LoongArch/MCTargetDesc LoongArchMCTargetDesc.cpp

Revert "[sancov][LoongArch] Resolve pcaddu18i+jirl in evaluateBranch and teac…"

This reverts commit 9c994f54497b54d8c44d2c6979a4cdaa8e4d7833.
DeltaFile
+0-78llvm/lib/Target/LoongArch/MCTargetDesc/LoongArchMCTargetDesc.cpp
+1-6llvm/tools/sancov/sancov.cpp
+3-3lld/test/ELF/loongarch-call36.s
+4-873 files

LLVM/project ddf8cd3clang/test/CodeGenHLSL/resources cbuffer.hlsl, llvm/test/CodeGen/DirectX/ImplicitBinding multiple-spaces.ll

[HLSL][DirectX] Remove uniformity bit from resource initialization intrinsics (#155332)

Removes uniformity bit from resource initialization intrinsics `llvm.{dx|spv}.resource.handlefrombinding` and `llvm.{dx|spv}.resource.handlefromimplicitbinding`. The flag currently always set to `false`. It should be derived from resource analysis and not provided by codegen.

Closes #135452
DeltaFile
+20-20llvm/test/CodeGen/SPIRV/pointers/structured-buffer-access.ll
+15-15llvm/test/CodeGen/DirectX/Metadata/uav_metadata.ll
+12-12llvm/test/CodeGen/DirectX/ImplicitBinding/multiple-spaces.ll
+12-12llvm/test/CodeGen/DirectX/Metadata/srv_metadata.ll
+12-12llvm/test/CodeGen/SPIRV/pointers/resource-vector-load-store.ll
+10-10clang/test/CodeGenHLSL/resources/cbuffer.hlsl
+81-81107 files not shown
+472-476113 files

LLVM/project dcbbda4clang/lib/Sema HLSLBuiltinTypeDeclBuilder.cpp HLSLBuiltinTypeDeclBuilder.h, clang/test/AST/HLSL ByteAddressBuffers-AST.hlsl TypedBuffers-AST.hlsl

[HLSL] Add static methods for resource initialization and a constructor from handle

Adds static methods `__createFromBinding` and `__createFromImplicitBinding`
to resource classes. These methods will be used for resource initialization
instead of the resource constructors that take binding information.

Also adds a private resource constructor that takes an initialized resource handle.
This constructor will be called from the static create methods.
DeltaFile
+150-19clang/lib/Sema/HLSLBuiltinTypeDeclBuilder.cpp
+65-2clang/test/AST/HLSL/ByteAddressBuffers-AST.hlsl
+63-0clang/test/AST/HLSL/TypedBuffers-AST.hlsl
+63-0clang/test/AST/HLSL/StructuredBuffers-AST.hlsl
+8-1clang/lib/Sema/HLSLBuiltinTypeDeclBuilder.h
+4-1clang/lib/Sema/HLSLExternalSemaSource.cpp
+353-231 files not shown
+354-237 files

LLVM/project d1b0071llvm/include/llvm/ADT DenseMapInfo.h

Revert "[ADT] Fix signed integer overflow (#155826)"

This reverts commit a01933d586343b7006bf354dc6557ba65621072e.
DeltaFile
+1-2llvm/include/llvm/ADT/DenseMapInfo.h
+1-21 files

LLVM/project 1084211flang/lib/Semantics check-omp-structure.cpp resolve-names.cpp, flang/test/Parser/OpenMP construct-prefix-conflict.f90

[flang][OpenMP] Replace OpenMPBlockConstruct with OmpBlockConstruct

OpenMPBlockConstruct, somewhat confusingly, represents most but not all
block-associated constructs. It's derived from OmpBlockConstruct, as are
all the remaining block-associated constructs.

It does not correspond to any well-defined group of constructs. It's the
collection of constructs that don't have their own types (and those that
do have their own types do so for their own reasons).

Using the broader OmpBlockConstruct in type-based visitors won't cause
issues, because the specific overloads (for classes derived from it) will
always be preferred.
DeltaFile
+15-15flang/test/Semantics/OpenMP/simd-only.f90
+6-6flang/lib/Semantics/check-omp-structure.cpp
+6-6flang/lib/Semantics/resolve-names.cpp
+5-5flang/lib/Semantics/rewrite-parse-tree.cpp
+5-5flang/test/Parser/OpenMP/construct-prefix-conflict.f90
+4-4flang/lib/Semantics/resolve-directives.cpp
+41-4114 files not shown
+65-7220 files

LLVM/project c8d0ff4flang/examples/FlangOmpReport FlangOmpReportVisitor.cpp

[flang][OpenMP] Simplify getLocation in FlangOmpReportVisitor, NFC

Use a utility function to get the construct source.
DeltaFile
+3-26flang/examples/FlangOmpReport/FlangOmpReportVisitor.cpp
+3-261 files

LLVM/project dff8788.ci cache_lit_timing_files.py

[CI] Prefix lit timing blobs with platform

Windows and Linux might have vastly different test runtimes. We also end
up with significantly fewer files on Windows and do not want to incur
the overhead of unpacking/repacking a bunch of files that we never end
up using.
DeltaFile
+3-1.ci/cache_lit_timing_files.py
+3-11 files

LLVM/project 05337femlir/examples/standalone/test/python smoketest.py

[MLIR][Python] fix StandalonePythonModules shipping all dialect modules
DeltaFile
+7-0mlir/examples/standalone/test/python/smoketest.py
+7-01 files

LLVM/project e5cfb97clang/lib/Sema HLSLBuiltinTypeDeclBuilder.cpp HLSLBuiltinTypeDeclBuilder.h, clang/test/AST/HLSL ByteAddressBuffers-AST.hlsl TypedBuffers-AST.hlsl

[HLSL] Add static methods for resource initialization and a constructor from handle

Adds static methods `__createFromBinding` and `__createFromImplicitBinding`
to resource classes. These methods will be used for resource initialization
instead of the resource constructors that take binding information.

Also adds a private resource constructor that takes an initialized resource handle.
This constructor will be called from the static create methods.
DeltaFile
+149-19clang/lib/Sema/HLSLBuiltinTypeDeclBuilder.cpp
+65-2clang/test/AST/HLSL/ByteAddressBuffers-AST.hlsl
+63-0clang/test/AST/HLSL/TypedBuffers-AST.hlsl
+63-0clang/test/AST/HLSL/StructuredBuffers-AST.hlsl
+8-1clang/lib/Sema/HLSLBuiltinTypeDeclBuilder.h
+4-1clang/lib/Sema/HLSLExternalSemaSource.cpp
+352-231 files not shown
+353-237 files

LLVM/project 111844aclang/test/CodeGen unified-lto-pipeline.c asan-unified-lto.ll

[LTO] Enhance unified/nonunified LTO checks. (#148229)

For the PS targets, unified LTO pipeline is used as default behaviour when enabling LTO. Other targets use Distinct LTO pipeline behaviour as default behavious. Unified/nonunified LTO checks are enhanced in this PR:
1. Check that the default, unified, and non-unified behavior for asan-unified-lto.ll all match (irrespective of what the default unified/nonunified behavior is).
2. Check that the difference in behavior for 'unified' and 'full' runs is (regarding the passes) is what is expected (rather than simply that they are different).
DeltaFile
+4-2clang/test/CodeGen/unified-lto-pipeline.c
+1-0clang/test/CodeGen/asan-unified-lto.ll
+5-22 files

LLVM/project 11c19eamlir/lib/Dialect/Tosa/Transforms TosaValidation.cpp

[MLIR] Apply clang-tidy fixes for llvm-else-after-return in TosaValidation.cpp (NFC)
DeltaFile
+5-3mlir/lib/Dialect/Tosa/Transforms/TosaValidation.cpp
+5-31 files

LLVM/project 4913aeelldb/test/API/lang/cpp/libcxx-internals-recognizer TestLibcxxInternalsRecognizer.py

[lldb] Skip test with older version of clang
DeltaFile
+1-1lldb/test/API/lang/cpp/libcxx-internals-recognizer/TestLibcxxInternalsRecognizer.py
+1-11 files

LLVM/project 7a0d3caclang/lib/CodeGen CGHLSLBuiltins.cpp, clang/lib/Sema SemaHLSL.cpp HLSLBuiltinTypeDeclBuilder.cpp

[HLSL] Reorder the arguments of handle initialization builtins

Reorder the arguments of handle initialization builtins to match the order of the
llvm intrinsics, and also to match the arguments on the static create methods for
resources (coming soon).
DeltaFile
+4-4clang/lib/CodeGen/CGHLSLBuiltins.cpp
+2-2clang/lib/Sema/SemaHLSL.cpp
+1-1clang/test/AST/HLSL/ByteAddressBuffers-AST.hlsl
+1-1clang/lib/Sema/HLSLBuiltinTypeDeclBuilder.cpp
+1-1clang/test/AST/HLSL/StructuredBuffers-AST.hlsl
+1-1clang/test/AST/HLSL/TypedBuffers-AST.hlsl
+10-106 files

LLVM/project f5694f0clang/lib/Headers emmintrin.h avx2intrin.h, clang/test/CodeGen/X86 sse2-builtins.c avx2-builtins.c

[Headers][X86] Update SSE/AVX integer comparison intrinsics to be used in constexpr (#155656)

Closes #155413
DeltaFile
+18-18clang/lib/Headers/emmintrin.h
+8-8clang/lib/Headers/avx2intrin.h
+15-0clang/test/CodeGen/X86/sse2-builtins.c
+14-0clang/test/CodeGen/X86/avx2-builtins.c
+4-4clang/lib/Headers/smmintrin.h
+2-0clang/test/CodeGen/X86/sse42-builtins.c
+61-301 files not shown
+62-307 files

LLVM/project 9a99896llvm/lib/CodeGen TargetLoweringObjectFileImpl.cpp, llvm/test/CodeGen/RISCV dwarf-eh.ll

[TargetLoweringObjectFile] Handle riscv BE (#155166)

Add DWARF exception handling support for riscv big-endian targets.
More CodeGen changes related to riscvbe are coming.
DeltaFile
+16-0llvm/test/CodeGen/RISCV/dwarf-eh.ll
+2-0llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp
+18-02 files