LLVM/project f88d060mlir/lib/Dialect/AMDGPU/IR AMDGPUDialect.cpp, mlir/test/Conversion/AMDGPUToROCDL memory_counter_wait_unsupported.mlir memory_counter_wait_tensor.mlir

[mlir][amdgpu] `memory_counter_wait` tensor counter support (#171153)

DeltaFile
+11-0mlir/test/Conversion/AMDGPUToROCDL/memory_counter_wait_unsupported.mlir
+6-4mlir/test/Dialect/AMDGPU/ops.mlir
+9-0mlir/test/Conversion/AMDGPUToROCDL/memory_counter_wait_tensor.mlir
+4-4mlir/test/Conversion/AMDGPUToROCDL/memory_counter_wait.mlir
+5-3mlir/lib/Dialect/AMDGPU/IR/AMDGPUDialect.cpp
+3-3mlir/test/Dialect/AMDGPU/canonicalize.mlir
+38-142 files not shown
+47-168 files

LLVM/project f27fbcalldb/include/lldb/Core Module.h, lldb/include/lldb/Symbol SymbolFile.h

[lldb][NFC] Replace const std::vector& with ArrayRef in APIs (#170834)

Inside the LLVM codebase, const vector& should just be ArrayRef, as this
more general API works both with vectors, SmallVectors and
SmallVectorImpl, as well as with single elements.

This commit replaces two uses introduced in
https://github.com/llvm/llvm-project/pull/168797 .
DeltaFile
+8-17lldb/source/Core/Module.cpp
+1-18lldb/include/lldb/Core/Module.h
+4-4lldb/source/Symbol/SymbolFile.cpp
+3-4lldb/include/lldb/Symbol/SymbolFile.h
+16-434 files

LLVM/project cbce30eclang/lib/Interpreter IncrementalAction.cpp, clang/test/Interpreter pretty-print.c

[clang-repl] Skip CodeGen for top-level decls when diagnostics report errors (#169989)

We can see the following while running clang-repl in C mode 
```
anutosh491 at vv-nuc:/build/anutosh491/llvm-project/build/bin$ ./clang-repl --Xcc=-x --Xcc=c --Xcc=-std=c23
clang-repl> printf("hi\n");
In file included from <<< inputs >>>:1:
input_line_1:1:1: error: call to undeclared library function 'printf' with type 'int (const char *, ...)'; ISO C99 and
      later do not support implicit function declarations [-Wimplicit-function-declaration]
    1 | printf("hi\n");
      | ^
input_line_1:1:1: note: include the header <stdio.h> or explicitly provide a declaration for 'printf'
error: Parsing failed.
clang-repl> #include <stdio.h>
hi
```

In debug mode while dumping the generated Module, i see this 
```

    [65 lines not shown]
DeltaFile
+5-0clang/lib/Interpreter/IncrementalAction.cpp
+3-0clang/test/Interpreter/pretty-print.c
+8-02 files

LLVM/project c7022e3flang/include/flang/Parser openmp-utils.h

Add empty
DeltaFile
+1-0flang/include/flang/Parser/openmp-utils.h
+1-01 files

LLVM/project 1bf0959libc/src/__support/time abs_timeout.h monotonicity.h, libc/src/__support/time/linux abs_timeout.h monotonicity.h

Merge branch 'main' into users/kparzysz/r01-loop-range
DeltaFile
+0-50libc/src/__support/time/linux/abs_timeout.h
+50-0libc/src/__support/time/abs_timeout.h
+0-44libc/src/__support/time/linux/monotonicity.h
+44-0libc/src/__support/time/monotonicity.h
+0-21libc/src/__support/time/linux/CMakeLists.txt
+20-0libc/src/__support/time/CMakeLists.txt
+114-11512 files not shown
+139-13218 files

LLVM/project 58cb487mlir/lib/Pass PassRegistry.cpp

Revert "[mlir] Fix '-Wtemplate-id-cdtor'. NFC"

This reverts commit fccb65ef8f0faf40ca5dfaaa0ef70821f8843821.

It breaks pre-merge CI:
```
2025-12-08T16:35:11.7239054Z /home/gha/actions-runner/_work/llvm-project/llvm-project/mlir/lib/Pass/PassRegistry.cpp:439:37: error: ISO C++ requires the name after '::~' to be found in the same scope as the name before '::~' [-Werror,-Wdtor-name]
2025-12-08T16:35:11.7240458Z   439 | llvm::cl::OptionValue<OpPassManager>::~OptionValue() = default;
2025-12-08T16:35:11.7241014Z       | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~
2025-12-08T16:35:11.7241494Z       |                                     ::OptionValue
2025-12-08T16:35:11.7241903Z 1 error generated.
```
DeltaFile
+1-1mlir/lib/Pass/PassRegistry.cpp
+1-11 files

LLVM/project 2a5420elibc/src/__support/time abs_timeout.h monotonicity.h, libc/src/__support/time/linux abs_timeout.h monotonicity.h

[libc] move abs_timesout and monotonicity out of linux dir (#167719)

This patch moves abs_timeout and monotonicity out of the linux dir into
common. Both of these functions depend on clock_gettime which is the
actual os-dependent component. As other features in `__support/threads`
may want to use these, it's better to share it in common.
DeltaFile
+50-0libc/src/__support/time/abs_timeout.h
+0-50libc/src/__support/time/linux/abs_timeout.h
+44-0libc/src/__support/time/monotonicity.h
+0-44libc/src/__support/time/linux/monotonicity.h
+0-21libc/src/__support/time/linux/CMakeLists.txt
+20-0libc/src/__support/time/CMakeLists.txt
+114-11511 files not shown
+138-13117 files

LLVM/project 9df944fllvm/lib/Target/AArch64 AArch64CollectCPSpillInfo.cpp AArch64TargetMachine.cpp, llvm/test/CodeGen/AArch64 fptosi-sat-vector.ll fptoui-sat-vector.ll

Constant pool spilling
DeltaFile
+503-525llvm/test/CodeGen/AArch64/fptosi-sat-vector.ll
+942-0llvm/lib/Target/AArch64/AArch64CollectCPSpillInfo.cpp
+177-177llvm/test/CodeGen/AArch64/fptoui-sat-vector.ll
+19-44llvm/test/CodeGen/AArch64/arm64-fp128.ll
+11-0llvm/lib/Target/AArch64/AArch64TargetMachine.cpp
+2-7llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-stores.ll
+1,654-7534 files not shown
+1,664-75510 files

LLVM/project ed27a1bclang/lib/CIR/Lowering/DirectToLLVM LowerToLLVM.cpp

[CIR] Clean up visibility conversion (NFC)
DeltaFile
+5-7clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
+5-71 files

LLVM/project 9804f21lldb/source/Commands CommandObjectBreakpoint.cpp, llvm/test/CodeGen/AMDGPU atomicrmw_usub_sat.ll atomicrmw_usub_cond.ll

Merge branch 'main' into users/kparzysz/r01-loop-range
DeltaFile
+4,091-0llvm/test/CodeGen/AMDGPU/atomicrmw_usub_sat.ll
+2,027-185llvm/test/CodeGen/X86/shift-i512.ll
+1,563-413llvm/test/CodeGen/X86/bitcnt-big-integer.ll
+1,276-73lldb/source/Commands/CommandObjectBreakpoint.cpp
+1,315-0llvm/test/CodeGen/AMDGPU/atomicrmw_usub_cond.ll
+0-1,298openmp/runtime/src/include/omp_lib.h.var
+10,272-1,9691,163 files not shown
+51,296-17,4751,169 files

LLVM/project 614fe6dmlir/lib/Conversion/OpenMPToLLVM OpenMPToLLVM.cpp, mlir/test/Conversion/OpenMPToLLVM map-info-type-conversion-fail.mlir

[mlir][OpenMP] Fix crash in MapInfoOp conversion when type conversion fails (#171045)

Check the result of `convertType` before calling `TypeAttr::get`. This
prevents a crash on unsupported types (e.g. `tensor`) by ensuring the
pattern fails gracefully.

Added regression test: map-info-type-conversion-fail.mlir

Fixes: #108159
DeltaFile
+14-0mlir/test/Conversion/OpenMPToLLVM/map-info-type-conversion-fail.mlir
+3-0mlir/lib/Conversion/OpenMPToLLVM/OpenMPToLLVM.cpp
+17-02 files

LLVM/project b32a2f4clang/include/clang/Basic BuiltinsAMDGPU.def, clang/lib/AST ASTContext.cpp

[Clang][OpenCL][AMDGPU] Allow _Float16 and half vector type compatibility (#170605)

## Summary
Allowing implicit compatibility between `_Float16` vector types and
`half` vector types in OpenCL mode. This enables AMDGPU builtins to work
correctly across OpenCL, HIP, and C++ without requiring separate builtin
definitions.
## Problem Statement
When using AMDGPU image builtins that return half-precision vectors in
OpenCL, users encounter type incompatibility errors:
**Builtin Definition:**
`TARGET_BUILTIN(__builtin_amdgcn_image_load_1d_v4f16_i32, "V4xiiQtii",
"nc", "image-insts")`

**Test Case:**
```
typedef half half4 __attribute__((ext_vector_type(4)));
half4 test_builtin_image_load_1d_2(half4 v4f16, int i32, __amdgpu_texture_t tex) {
  return __builtin_amdgcn_image_load_1d_v4f16_i32(100, i32, tex, 120, i32);

    [11 lines not shown]
DeltaFile
+75-0clang/test/SemaOpenCL/half-float16-vector-compatibility.cl
+30-30clang/include/clang/Basic/BuiltinsAMDGPU.def
+36-0clang/test/CodeGen/half-float16-vector-compatibility.cl
+15-0clang/lib/AST/ASTContext.cpp
+8-1clang/lib/Sema/SemaExpr.cpp
+164-315 files

LLVM/project fccb65emlir/lib/Pass PassRegistry.cpp

[mlir] Fix '-Wtemplate-id-cdtor'. NFC
DeltaFile
+1-1mlir/lib/Pass/PassRegistry.cpp
+1-11 files

LLVM/project 1ae6217clang/lib/CIR/Lowering/DirectToLLVM LowerToLLVM.cpp

[CIR] Clean up visibility conversion (NFC)
DeltaFile
+5-7clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
+5-71 files

LLVM/project a05fc9ellvm/lib/Target/Hexagon HexagonGenWideningVecInstr.cpp

HexagonGenWideningVecInstr.cpp - fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFC. (#171095)

DeltaFile
+1-1llvm/lib/Target/Hexagon/HexagonGenWideningVecInstr.cpp
+1-11 files

LLVM/project d1ad085llvm/lib/Target/PowerPC PPCMIPeephole.cpp, llvm/test/CodeGen/PowerPC splat-after-xxsldwi.ll

Fix [PowerPC] llc crashed at -O1/O2/O3: Assertion `isImm() && "Wrong MachineOperand mutator"' failed. (#170548)

Fixed issue 
[[PowerPC] llc crashed at -O1/O2/O3: Assertion `isImm() && "Wrong
MachineOperand mutator"'
failed.](https://github.com/llvm/llvm-project/issues/167672)

the root cause of the crash, the IMM operand is in different operand num
of the instruction PPC::XXSPLTW and PPC::XXSPLTB/PPC::XXSPLTH.

and the patch also fix a potential bug that the new element index of
PPC::XXSPLTB/PPC::XXSPLTH/XXSPLTW use the same logic. It should be
different .We need to convert the element index into the proper unit
(byte for VSPLTB, halfword for VSPLTH, word for VSPLTW) because
PPC::XXSLDWI interprets its ShiftImm in 32-bit word units.
DeltaFile
+37-9llvm/lib/Target/PowerPC/PPCMIPeephole.cpp
+22-0llvm/test/CodeGen/PowerPC/splat-after-xxsldwi.ll
+59-92 files

LLVM/project 447af32mlir/lib/Conversion/XeGPUToXeVM XeGPUToXeVM.cpp, mlir/test/Conversion/XeGPUToXeVM loadstore_nd.mlir create_nd_tdesc.mlir

[MLIR][XeGPU][XeVM] create_nd_tdesc: use correct pitch from strides. (#170384)

Base memory pitch should be derived from base stride, not base width.
Remove offset fields from tensor descriptor payload and add pitch field.
DeltaFile
+7-53mlir/test/Conversion/XeGPUToXeVM/loadstore_nd.mlir
+24-25mlir/lib/Conversion/XeGPUToXeVM/XeGPUToXeVM.cpp
+16-21mlir/test/Conversion/XeGPUToXeVM/create_nd_tdesc.mlir
+5-16mlir/test/Conversion/XeGPUToXeVM/prefetch_nd.mlir
+52-1154 files

LLVM/project b8ddbc4mlir/test/Integration/Dialect/XeVM/GPU gpu_printf.mlir

[MLIR][XeVM] gpu.printf test: use correct runtime. (#170754)

gpu printf test was not using the runtime required by lit.local.cfg
All other tests in the directory are correctly using level zero runtime.
But gpu printf test is using sycl runtime.
DeltaFile
+1-1mlir/test/Integration/Dialect/XeVM/GPU/gpu_printf.mlir
+1-11 files

LLVM/project 4e8a185llvm/unittests/ADT BitVectorTest.cpp

Apply suggestions from code review

Co-authored-by: Jakub Kuderski <jakub at nod-labs.com>
DeltaFile
+0-3llvm/unittests/ADT/BitVectorTest.cpp
+0-31 files

LLVM/project 4846cd2llvm/lib/Target/AMDGPU SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU inline-asm.ll inlineasm-mismatched-size-error.ll

review comments
DeltaFile
+0-12llvm/test/CodeGen/AMDGPU/inline-asm.ll
+6-0llvm/test/CodeGen/AMDGPU/inlineasm-mismatched-size-error.ll
+1-4llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+7-163 files

LLVM/project e65d0e7llvm/lib/Target/AMDGPU SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU inlineasm-mismatched-size-error.ll

add more tests
DeltaFile
+18-0llvm/test/CodeGen/AMDGPU/inlineasm-mismatched-size-error.ll
+2-0llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+20-02 files

LLVM/project 88a9db4llvm/test/CodeGen/AMDGPU inline-asm-use-bool.ll inline-asm.ll

move test
DeltaFile
+0-15llvm/test/CodeGen/AMDGPU/inline-asm-use-bool.ll
+12-0llvm/test/CodeGen/AMDGPU/inline-asm.ll
+12-152 files

LLVM/project 4044fc5llvm/lib/Target/AMDGPU SIISelLowering.cpp, llvm/test/CodeGen/AMDGPU inline-asm-use-bool.ll

[AMDGPU] Fix a crash when a bool variable is used in inline asm
DeltaFile
+15-0llvm/test/CodeGen/AMDGPU/inline-asm-use-bool.ll
+5-0llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+20-02 files

LLVM/project 5e47501clang/lib/CIR/Lowering/DirectToLLVM LowerToLLVM.cpp

[CIR] Clean up visibility conversion (NFC)
DeltaFile
+5-7clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
+5-71 files

LLVM/project ca8419dmlir/include/mlir/Dialect/AMDGPU/IR AMDGPU.td, mlir/lib/Dialect/AMDGPU/IR AMDGPUDialect.cpp

[mlir][amdgpu] Fuse adjacent `MemoryCounterWaitOp` (#171148)

Taking the minimum value.
DeltaFile
+45-0mlir/lib/Dialect/AMDGPU/IR/AMDGPUDialect.cpp
+36-0mlir/test/Dialect/AMDGPU/canonicalize.mlir
+2-0mlir/include/mlir/Dialect/AMDGPU/IR/AMDGPU.td
+83-03 files

LLVM/project 0a9455aclang/lib/CIR/Lowering/DirectToLLVM LowerToLLVM.cpp

[CIR] Clean up visibility conversion (NFC)
DeltaFile
+5-7clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
+5-71 files

LLVM/project ebdb903llvm/lib/Target/X86 X86ISelLowering.cpp, llvm/test/CodeGen/X86 vector-shuffle-combining-avx512f.ll compress-undef-float-passthrough.ll

[X86] Handle X86ISD::EXPAND/COMPRESS nodes as target shuffles (#171119)

Allows for shuffle simplification

Required a minor fix to the overly reduced compress-undef-float-passthrough.ll regression test
DeltaFile
+11-56llvm/test/CodeGen/X86/vector-shuffle-combining-avx512f.ll
+46-0llvm/lib/Target/X86/X86ISelLowering.cpp
+2-2llvm/test/CodeGen/X86/compress-undef-float-passthrough.ll
+59-583 files

LLVM/project b08c72bflang/lib/Parser openmp-parsers.cpp unparse.cpp, flang/test/Lower/OpenMP/Todo threadset.f90

[Flang][OpenMP] Enables parsing of threadset clause (#169856)

DeltaFile
+79-0flang/test/Parser/OpenMP/threadset-clause.f90
+10-0flang/test/Lower/OpenMP/Todo/threadset.f90
+9-0flang/test/Semantics/OpenMP/threadset-clause.f90
+7-1flang/lib/Parser/openmp-parsers.cpp
+1-0flang/lib/Parser/unparse.cpp
+106-15 files

LLVM/project c5b9010llvm/lib/Transforms/Vectorize VPlanTransforms.cpp, llvm/test/Transforms/LoopVectorize/AArch64 pr60831-sve-inv-store-crash.ll sve-vector-reverse.ll

[VPlan] Use nuw when computing {VF,VScale}xUF (#170710)

These quantities should never unsigned-wrap. This matches the behavior
if only VFxUF is used (and not VF): when computing both VF and VFxUF,
nuw should hold for each step separately.
DeltaFile
+12-12llvm/test/Transforms/LoopVectorize/RISCV/riscv-vector-reverse.ll
+7-7llvm/test/Transforms/LoopVectorize/RISCV/strided-accesses.ll
+3-3llvm/test/Transforms/LoopVectorize/AArch64/pr60831-sve-inv-store-crash.ll
+4-2llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
+2-2llvm/test/Transforms/LoopVectorize/AArch64/sve-vector-reverse.ll
+2-2llvm/test/Transforms/LoopVectorize/AArch64/sve-inductions-unusual-types.ll
+30-284 files not shown
+35-3310 files

LLVM/project fb3eff2bolt/include/bolt/Passes LivenessAnalysis.h, bolt/lib/Passes ShrinkWrapping.cpp TailDuplication.cpp

[ADT] Make use of subsetOf and anyCommon methods of BitVector (NFC)

Replace the code along these lines

    BitVector Tmp = LHS;
    Tmp &= RHS;
    return Tmp.any();

and

    BitVector Tmp = LHS;
    Tmp.reset(RHS);
    return Tmp.none();

with `LHS.anyCommon(RHS)` and `LHS.subsetOf(RHS)`, correspondingly, which
do not require creating temporary BitVector and can return early.
DeltaFile
+4-6bolt/lib/Passes/ShrinkWrapping.cpp
+4-4bolt/lib/Passes/TailDuplication.cpp
+2-6bolt/lib/Passes/RegReAssign.cpp
+2-4llvm/tools/llvm-exegesis/lib/SnippetGenerator.cpp
+2-4llvm/lib/CodeGen/RDFRegisters.cpp
+2-3bolt/include/bolt/Passes/LivenessAnalysis.h
+16-272 files not shown
+18-318 files