LLVM/project 6b04214flang/include/flang/Semantics openmp-utils.h, flang/lib/Semantics check-omp-loop.cpp

[flang][OpenMP] Import ExecutionPartIterator et al into semantics::om… (#185288)

…p, NFC

Issue: https://github.com/llvm/llvm-project/issues/185287
DeltaFile
+13-0flang/include/flang/Semantics/openmp-utils.h
+0-3flang/lib/Semantics/check-omp-loop.cpp
+13-32 files

LLVM/project f46a6c6clang/include/clang/AST Decl.h, clang/lib/AST Decl.cpp DeclCXX.cpp

[Clang][NFC] Use const ASTContext reference in Decl Create methods

Update Create() static factory methods, CreateDeserialized() methods, and
constructors in Decl.h to accept const ASTContext& instead of ASTContext&.
This change makes ASTContext parameters const-correct for declaration
creation and deserialization, affecting all Decl subclasses declared in
Decl.h.

Exceptions kept as non-const (only 3 methods):
- TranslationUnitDecl::Create() and constructor: stores non-const
  ASTContext& member that is returned by getASTContext()
- DefaultedOrDeletedFunctionInfo::Create(): calls Context.Allocate()
  which requires non-const access

Assisted-By: Claude Sonnect 4.5
DeltaFile
+92-80clang/include/clang/AST/Decl.h
+82-71clang/lib/AST/Decl.cpp
+3-3clang/lib/AST/DeclCXX.cpp
+177-1543 files

LLVM/project ff232d6clang-tools-extra/test/clang-tidy/checkers/Inputs/Headers utility, clang-tools-extra/test/clang-tidy/checkers/cppcoreguidelines pro-bounds-constant-array-index-includestyle.cpp

[clang-tidy][NFC] Add missing Option tests in cppcoreguidelines and performance [3/N] (#185210)

This PR adds testcases for untested Options in `cppcoreguidelines` and
`performance` modules for better test coverage, specifically:

- `cppcoreguidelines-init-variables`: `IncludeStyle`, `MathHeader`.
- `cppcoreguidelines-pro-bounds-constant-array-index`: `IncludeStyle`.
- `performance-inefficient-string-concatenation`: `StrictMode`.
- `performance-no-automatic-move`: `AllowedTypes`.
- `performance-type-promotion-in-math-fn`: `IncludeStyle`.
- `performance-unnecessary-value-param`: `IncludeStyle`.

As of AI Usage: Assisted by Gemini 3 and Claude (Writing part of the
testcases and pre-commit reviewing).
DeltaFile
+40-0clang-tools-extra/test/clang-tidy/checkers/performance/no-automatic-move-allowed-types.cpp
+28-0clang-tools-extra/test/clang-tidy/checkers/cppcoreguidelines/pro-bounds-constant-array-index-includestyle.cpp
+2-25clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-value-param.cpp
+22-0clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-value-param-includestyle.cpp
+18-0clang-tools-extra/test/clang-tidy/checkers/Inputs/Headers/utility
+18-0clang-tools-extra/test/clang-tidy/checkers/performance/inefficient-string-concatenation-strict.cpp
+128-252 files not shown
+154-258 files

LLVM/project 9e883b4clang/test/Lexer cxx20-module-directive.cpp

[clang][test] Add missing FileCheck pipe in cxx20-module-directive.cpp (#185315)

The test had CHECK directives that were never executed because the RUN
line did not pipe output to FileCheck.
DeltaFile
+1-1clang/test/Lexer/cxx20-module-directive.cpp
+1-11 files

LLVM/project 6aa7538llvm/lib/Target/AArch64 AArch64InstrInfo.td AArch64InstrFormats.td, llvm/test/CodeGen/AArch64 sve-streaming-mode-fixed-length-int-to-fp.ll sve-streaming-mode-cvt-fp-int-fp.ll

[LLVM][AArch64] Allow vector converts  to run in streaming mode with … (#177375)

…FPRCVT

Vector Saturated converts fp->int and converts int->fp are now allowed
to run in streaming mode when FEAT_FPRCVT feature is available.

Therefore the patch replaces HasNEONandIsSME2p2StreamingSafe by
HasNEONandIsFPRCVTStreamingSafe following the latest update in [1] for
the Vector CVT instructions.

It also allows the ISD Node FP_TO_SINT_SAT to do custom lowering instead
of expand when it is a fixed lengh data type, because it can use the
Vector CVT instructions. I believe this is correct because there is
always a compatible CVT instruction for the SME core.

Because now the compiler allows the fixed length CVT to run in streaming
mode, I needed to fix the function LowerVectorFP_TO_INT_SAT, the FRINT
instruction were creating a ilegal size for CVT instructions that was

    [4 lines not shown]
DeltaFile
+507-0llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-to-fp.ll
+38-37llvm/test/CodeGen/AArch64/sve-streaming-mode-cvt-fp-int-fp.ll
+48-20llvm/lib/Target/AArch64/AArch64InstrInfo.td
+37-19llvm/lib/Target/AArch64/AArch64InstrFormats.td
+630-764 files

LLVM/project 3833551mlir/lib/Conversion/GPUCommon GPUToLLVMConversion.cpp, mlir/test/Conversion/GPUCommon lower-launch-func-bare-ptr-invalid.mlir

[mlir][gpu] Fix crash in gpu-to-llvm with unranked memref and bare-ptr calling convention (#185062)

When using `--gpu-to-llvm` with `use-bare-pointers-for-kernels=true` and
a `gpu.launch_func` whose kernel has an `UnrankedMemRefType` argument,
`LLVMTypeConverter::promoteOperands` would hit an `llvm_unreachable`
because unranked memrefs are not supported with the bare-pointer calling
convention.

Fix this by checking for unranked memref kernel arguments before calling
`promoteOperands` and returning a proper conversion failure with a
diagnostic instead of crashing.

Fixes #184939

Assisted-by: Claude Code
DeltaFile
+23-0mlir/test/Conversion/GPUCommon/lower-launch-func-bare-ptr-invalid.mlir
+10-0mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp
+33-02 files

LLVM/project 36de257llvm/lib/Transforms/InstCombine InstCombineCalls.cpp, llvm/test/Transforms/InstCombine get_active_lane_mask.ll

[InstCombine] Handle fixed-width results in get_active_lane_mask fold (#185317)

The optimization introduced in #183329 incorrectly assumed that any
extraction from a scalable active lane mask used a scalable index. When
the result of a `llvm.vector.extract` is a fixed-width vector, the index
should not be multiplied by vscale.

This PR adds a check to ensure the index is only scaled by VScaleMin
when the return type of the extraction is a scalable vector, not
fixed-width.

Fixes #185271
DeltaFile
+13-0llvm/test/Transforms/InstCombine/get_active_lane_mask.ll
+3-1llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
+16-12 files

LLVM/project c54cb1bmlir/lib/Dialect/Linalg/IR LinalgOps.cpp, mlir/test/Dialect/Linalg named-ops-fail.mlir named-ops.mlir

[mlir][linalg] Emit proper diagnostic instead of crashing in SelectOp with index type (#183652)

`buildTernaryFn` for `TernaryFn::select` called `llvm_unreachable` when
the operand types were not `i1`, integer, or floating-point (e.g.,
`index` type).
Instead delegate this checking to the IR verifier: we don't need to
duplicate the checks from the verifier in an assertion here.

Fixes #179046

Assisted-by: Claude Code
DeltaFile
+33-0mlir/test/Dialect/Linalg/named-ops-fail.mlir
+14-0mlir/test/Dialect/Linalg/named-ops.mlir
+0-7mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
+47-73 files

LLVM/project 9f3d9f4clang/lib/AST/ByteCode Compiler.cpp

[clang][bytecode][NFC] Name all expressions E (#185379)

At least the ones we visit directly via the visitor. This was always the
case, except for BinaryOperator and CastExpr.
DeltaFile
+213-216clang/lib/AST/ByteCode/Compiler.cpp
+213-2161 files

LLVM/project 36456b4clang/include/clang/APINotes Types.h

revise comment
DeltaFile
+3-3clang/include/clang/APINotes/Types.h
+3-31 files

LLVM/project 56fe909llvm/examples/Kaleidoscope/Chapter3 toy.cpp

kaleidoscope: add missing check in the FunctionAST::codegen  (#76322)

kaleidoscope chapter 03 explanation has this function redefine check,
but it was missing in the code sample.

Signed-off-by: amila <amila.15 at cse.mrt.ac.lk>
DeltaFile
+3-0llvm/examples/Kaleidoscope/Chapter3/toy.cpp
+3-01 files

LLVM/project 4994bcellvm/include/llvm/ExecutionEngine/Orc WaitingOnGraph.h

Fix MSVC "not all control paths return a value" warning. NFC. (#185388)
DeltaFile
+1-0llvm/include/llvm/ExecutionEngine/Orc/WaitingOnGraph.h
+1-01 files

LLVM/project 57edc20llvm/lib/Target/NVPTX/MCTargetDesc NVPTXMCAsmInfo.cpp

[NVPTX][NFC] Fix TODO comment style in NVPTXMCAsmInfo.cpp (#185332)

Replace `@TODO` with `TODO` to follow LLVM comment conventions.

This is an NFC change — no functional impact.
DeltaFile
+1-1llvm/lib/Target/NVPTX/MCTargetDesc/NVPTXMCAsmInfo.cpp
+1-11 files

LLVM/project 47e3108clang/lib/CodeGen/TargetBuiltins AMDGPU.cpp, clang/test/CodeGen amdgpu-abi-version.c

clang/AMDGPU: Fix workgroup size builtins for nonuniform work group sizes (#185098)

These were assuming uniform work group sizes. Emit the v4 and v5
sequences to take the remainder group for the nonuniform case.

Currently the device libs uses this builtin on the legacy ABI path with
the same sequence to calculate the remainder, and fully implements the
v5 path. If you perform a franken-build of the library with the updated
builtin, the result is worse. The duplicate sequence does not fully fold out.
However, it does not appear to be wrong. The relevant conformance tests still
pass.
DeltaFile
+627-0clang/test/CodeGenOpenCL/builtins-amdgcn-workgroup-size.cl
+123-36clang/lib/CodeGen/TargetBuiltins/AMDGPU.cpp
+100-16clang/test/CodeGenCUDA/amdgpu-workgroup-size.cu
+60-30clang/test/Headers/gpuintrin.c
+35-15clang/test/CodeGen/amdgpu-abi-version.c
+0-19clang/test/CodeGenOpenCL/builtins-amdgcn.cl
+945-1166 files

LLVM/project 3e3c3abllvm/tools/llvm-gpu-loader llvm-gpu-loader.cpp

[llvm][tools] Improve llvm-gpu-loader checks (#184791)

When devices are not properly initialized llvm-gpu-loader follows corrupt pointers which
result in hard to debug crashes.

This improves the checks to avoid such situations.
DeltaFile
+6-2llvm/tools/llvm-gpu-loader/llvm-gpu-loader.cpp
+6-21 files

LLVM/project 83732a2clang/include/clang/Analysis/Scalable SSAFBuiltinForceLinker.h SSAFForceLinker.h, clang/include/clang/Analysis/Scalable/Frontend TUSummaryExtractorFrontendAction.h

Revert "[clang][ssaf] Add --ssaf-extract-summaries= and --ssaf-tu-summary-file= options" (#185391)

Reverts llvm/llvm-project#184421

Broke build bot:

https://lab.llvm.org/buildbot/#/builders/204/builds/38459/steps/7/logs/stdio

```
[7440/8486] Linking CXX shared library lib/libclangAnalysisScalable.so.23.0git
FAILED: lib/libclangAnalysisScalable.so.23.0git 
: && /usr/bin/c++ -fPIC -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wno-missing-field-initializers -pedantic -Wno-long-long -Wimplicit-fallthrough -Wno-uninitialized -Wno-nonnull -Wno-class-memaccess -Wno-array-bounds -Wno-noexcept-type -Wdelete-non-virtual-dtor -Wno-comment -Wno-misleading-indentation -fdiagnostics-color -ffunction-sections -fdata-sections -fno-common -Woverloaded-virtual -O3 -DNDEBUG  -Wl,-z,defs -Wl,-z,nodelete   -Wl,-rpath-link,/home/botworker/bbot/amdgpu-offload-rhel-8-cmake-build-only/build/./lib  -Wl,--gc-sections -shared -Wl,-soname,libclangAnalysisScalable.so.23.0git -o lib/libclangAnalysisScalable.so.23.0git tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/ASTEntityMapping.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/EntityLinker/EntityLinker.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Frontend/TUSummaryExtractorFrontendAction.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/BuildNamespace.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/EntityId.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/EntityIdTable.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/EntityLinkage.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/EntityName.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Model/SummaryName.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/JSONFormat/JSONFormatImpl.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/JSONFormat/LUSummary.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/JSONFormat/LUSummaryEncoding.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/JSONFormat/TUSummary.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/JSONFormat/TUSummaryEncoding.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Serialization/SerializationFormatRegistry.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Support/ErrorBuilder.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/TUSummary/ExtractorRegistry.cpp.o tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/TUSummary/TUSummaryBuilder.cpp.o  -Wl,-rpath,"\$ORIGIN/../lib:/home/botworker/bbot/amdgpu-offload-rhel-8-cmake-build-only/build/lib:"  lib/libclangIndex.so.23.0git  lib/libclangFrontend.so.23.0git  lib/libclangASTMatchers.so.23.0git  lib/libclangAST.so.23.0git  lib/libclangLex.so.23.0git  lib/libclangBasic.so.23.0git  lib/libLLVMSupport.so.23.0git  -Wl,-rpath-link,/home/botworker/bbot/amdgpu-offload-rhel-8-cmake-build-only/build/lib && :
tools/clang/lib/Analysis/Scalable/CMakeFiles/obj.clangAnalysisScalable.dir/Frontend/TUSummaryExtractorFrontendAction.cpp.o:(.data.rel.ro._ZTVN12_GLOBAL__N_115TUSummaryRunnerE+0xc0): undefined reference to `clang::SemaConsumer::anchor()'
collect2: error: ld returned 1 exit status
```
DeltaFile
+0-362clang/unittests/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendActionTest.cpp
+0-181clang/lib/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendAction.cpp
+0-51clang/unittests/Analysis/Scalable/SSAFBuiltinTestForceLinker.h
+0-33clang/include/clang/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendAction.h
+0-28clang/include/clang/Analysis/Scalable/SSAFBuiltinForceLinker.h
+0-25clang/include/clang/Analysis/Scalable/SSAFForceLinker.h
+0-68023 files not shown
+10-85929 files

LLVM/project 1e7300flldb/source/Plugins/Process/gdb-remote GDBRemoteCommunicationServerCommon.cpp

[PPC64][Linux] Watchpoint configuration for PPC64 (#185192)

On PPC64, SIGTRAP is delivered before the triggering instruction
completes. The previous implementation wrongly assumed that the
instruction gets executed and due to that qHostInfo has the wrong info,
causing the SingleStep being avoided on Linux for PPC64, and the
execution being stuck at the store instruction.
This patch corrects this, making watchpoint work.
DeltaFile
+1-1lldb/source/Plugins/Process/gdb-remote/GDBRemoteCommunicationServerCommon.cpp
+1-11 files

LLVM/project 5d52d2fclang/include/clang/Analysis/Scalable SSAFBuiltinForceLinker.h SSAFForceLinker.h, clang/include/clang/Analysis/Scalable/Frontend TUSummaryExtractorFrontendAction.h

[clang][ssaf] Add --ssaf-extract-summaries= and --ssaf-tu-summary-file= options (#184421)

Along the way, it also fixes the static registration of the builtin
(upstream) formats and extractors.
DeltaFile
+362-0clang/unittests/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendActionTest.cpp
+181-0clang/lib/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendAction.cpp
+51-0clang/unittests/Analysis/Scalable/SSAFBuiltinTestForceLinker.h
+33-0clang/include/clang/Analysis/Scalable/Frontend/TUSummaryExtractorFrontendAction.h
+28-0clang/include/clang/Analysis/Scalable/SSAFBuiltinForceLinker.h
+25-0clang/include/clang/Analysis/Scalable/SSAFForceLinker.h
+680-023 files not shown
+859-1029 files

LLVM/project c73bd3elibclc CMakeLists.txt

libclc: Remove old mesa amdgcn targets (#185385)

amdgcn-- was probably dead when clover was being maintained, since
it switched to using amdgcn-mesa-mesa3d. Also remove amdgcn-mesa-mesa3d,
since clover is no longer in mesa.
DeltaFile
+0-2libclc/CMakeLists.txt
+0-21 files

LLVM/project 0be62beflang/test/Lower/Intrinsics date_and_time.f90 dim.f90

[flang][NFC] Converted five tests from old lowering to new lowering (part 27) (#185340)

Tests converted from test/Lower/Intrinsics: date_and_time.f90,
dconjg.f90, dim.f90, dimag.f90, dprod.f90
DeltaFile
+39-27flang/test/Lower/Intrinsics/date_and_time.f90
+17-11flang/test/Lower/Intrinsics/dim.f90
+9-5flang/test/Lower/Intrinsics/dprod.f90
+6-3flang/test/Lower/Intrinsics/dconjg.f90
+6-3flang/test/Lower/Intrinsics/dimag.f90
+77-495 files

LLVM/project 0da00c3llvm/test/Transforms/LoopVectorize select-cmp.ll iv-select-cmp.ll, llvm/test/Transforms/LoopVectorize/AArch64 conditional-scalar-assignment.ll sve-select-cmp.ll

[LV] Support float and pointer FindLast reductions  (#184101)

This duplicates #182313 with some very small modifications on top, as
@dheaton-arm is unable
to finish the PR and I'm unable to push to his branch.

Expands support for the `FindLast` Reccurence Kind to floating-point and
pointer types, thereby
enabling conditional scalar assignment (CSA) for these types.

Originally authored by @dheaton-arm

---------

Co-authored-by: Damian Heaton <Damian.Heaton at arm.com>
DeltaFile
+210-26llvm/test/Transforms/LoopVectorize/select-cmp.ll
+137-38llvm/test/Transforms/LoopVectorize/AArch64/conditional-scalar-assignment.ll
+143-28llvm/test/Transforms/LoopVectorize/iv-select-cmp.ll
+121-30llvm/test/Transforms/LoopVectorize/X86/conditional-scalar-assignment.ll
+38-11llvm/test/Transforms/LoopVectorize/RISCV/select-cmp-reduction.ll
+13-2llvm/test/Transforms/LoopVectorize/AArch64/sve-select-cmp.ll
+662-1352 files not shown
+672-1378 files

LLVM/project 6bf37b0utils/bazel/llvm-project-overlay/llvm BUILD.bazel

Fix bazel build after #168421 (#185387)
DeltaFile
+4-0utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
+4-01 files

LLVM/project 38d4509llvm/lib/Target/AMDGPU AMDGPULibCalls.cpp, llvm/test/CodeGen/AMDGPU amdgpu-simplify-libcall-rootn-codegen.ll amdgpu-simplify-libcall-rootn.ll

AMDGPU: Fix selection failure on fast vector rootn (#185035)

This was emitting the raw rcp intrinsic, which will fail for any
vector type. This is an afn context anyway, so just emit fdiv
which will select to rcp but also will undergo type legalization.
DeltaFile
+145-0llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-rootn-codegen.ll
+40-44llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-rootn.ll
+4-1llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
+189-453 files

LLVM/project d840396clang/include/clang/Options Options.td, clang/lib/CodeGen CGCall.cpp

clang: Simplify emission of uniform-work-group-size attribute (#185066)
DeltaFile
+28-25clang/test/CodeGenCUDA/convergent.cu
+6-22clang/lib/CodeGen/CGCall.cpp
+9-9clang/test/CodeGenOpenCL/convergent.cl
+3-3clang/test/CodeGenHIP/default-attributes.hip
+2-2clang/test/CodeGenHIP/hip_weak_alias.cpp
+1-1clang/include/clang/Options/Options.td
+49-622 files not shown
+51-648 files

LLVM/project 0d32195lldb/unittests/Platform PlatformDarwinTest.cpp

[lldb][test][NFC] PlatformDarwinTest.cpp: fix directory creation comment

To be consistent with the other comments in the same test.
DeltaFile
+2-2lldb/unittests/Platform/PlatformDarwinTest.cpp
+2-21 files

LLVM/project a1866b8mlir/include/mlir/Dialect/LLVMIR LLVMAttrDefs.td, mlir/lib/Target/LLVMIR DebugImporter.cpp

[mlir][llvm] Make name optional for DIBasicTypeAttr and DIStringTypeAttr (#185284)

The name field of DICompositeTypeAttr was already optional, but
DIBasicTypeAttr and DIStringTypeAttr were not handled consistently. Make
the name parameter of both an OptionalParameter to support LLVM debug
info nodes with no name. Update DebugImporter to use getStringAttrOrNull
when translating the name of these types. Add tests for the null-name
cases in the import and export test files.
DeltaFile
+17-20mlir/test/Target/LLVMIR/Import/debug-info.ll
+9-6mlir/test/Target/LLVMIR/llvmir-debug.mlir
+2-2mlir/include/mlir/Dialect/LLVMIR/LLVMAttrDefs.td
+2-1mlir/lib/Target/LLVMIR/DebugImporter.cpp
+30-294 files

LLVM/project 1c1163ellvm/lib/Target/AMDGPU AMDGPULibCalls.cpp, llvm/test/CodeGen/AMDGPU amdgpu-simplify-libcall-rootn-codegen.ll amdgpu-simplify-libcall-rootn.ll

AMDGPU: Fix selection failure on fast vector rootn

This was emitting the raw rcp intrinsic, which will fail for any
vector type. This is an afn context anyway, so just emit fdiv
which will select to rcp but also will undergo type legalization.
DeltaFile
+145-0llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-rootn-codegen.ll
+40-44llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-rootn.ll
+4-1llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
+189-453 files

LLVM/project bdde262libclc/clc/include/clc/collective clc_work_group_reduce.h clc_work_group_reduce.inc, libclc/clc/lib/generic/collective clc_work_group_reduce.inc clc_work_group_reduce.cl

libclc: Add work_group_reduce_* functions (#185368)
DeltaFile
+86-0libclc/clc/lib/generic/collective/clc_work_group_reduce.inc
+30-0libclc/clc/lib/generic/collective/clc_work_group_reduce.cl
+23-0libclc/opencl/lib/generic/collective/work_group_reduce.inc
+20-0libclc/clc/include/clc/collective/clc_work_group_reduce.h
+17-0libclc/clc/include/clc/collective/clc_work_group_reduce.inc
+15-0libclc/opencl/lib/generic/collective/work_group_reduce.cl
+191-02 files not shown
+193-08 files

LLVM/project fef02d4llvm/lib/Target/AMDGPU VOPInstructions.td VOP2Instructions.td, llvm/test/MC/Disassembler/AMDGPU gfx908_mai.txt gfx12_dasm_vop3_from_vop1_dpp16.txt

[AMDGPU] Ignore unused VALU src0/1/2 fields when disassembling (#175182)

This enables a future patch to change the default encoding of these
fields, which are mostly ignored by hardware.

(cherry picked from commit 8099e127cde22a5989515ce940eaa2a0a944d95e)
DeltaFile
+41-41llvm/lib/Target/AMDGPU/VOPInstructions.td
+12-12llvm/lib/Target/AMDGPU/VOP2Instructions.td
+5-5llvm/lib/Target/AMDGPU/VOPCInstructions.td
+8-0llvm/test/MC/Disassembler/AMDGPU/gfx908_mai.txt
+3-3llvm/lib/Target/AMDGPU/VOP3PInstructions.td
+4-0llvm/test/MC/Disassembler/AMDGPU/gfx12_dasm_vop3_from_vop1_dpp16.txt
+73-6112 files not shown
+110-6218 files

LLVM/project 05781f4mlir/lib/Dialect/Bufferization/Extensions ShardingExtensions.cpp, mlir/lib/Dialect/Shard/Transforms ShardingPropagation.cpp

[mlir][shard] Small fixes to partition pass (#185050)

- Empty functions (with no blocks) should are skipped by partition pass,
  blocks with more than one continue to get error-flagged
- fixed ShardingInterfaceImpl of bufferization.materialize_in_destination
DeltaFile
+11-13mlir/lib/Dialect/Bufferization/Extensions/ShardingExtensions.cpp
+10-6mlir/lib/Dialect/Shard/Transforms/ShardingPropagation.cpp
+8-1mlir/test/Dialect/Shard/sharding-propagation-failed.mlir
+29-203 files