LLVM/project 2cf2768lldb/test/API/functionalities/scripted_frame_provider/circular_dependency TestFrameProviderCircularDependency.py

[lldb/test] XFAIL TestFrameProviderCircularDependency.py on Windows

This patch disables TestFrameProviderCircularDependency.py on Windows
since the scripted frame provider uses SBTarget.FindFunctions which
doesn't seem to be working (according to TestTargetAPI.test_find_functions).

Signed-off-by: Med Ismail Bennani <ismail at bennani.ma>
DeltaFile
+1-0lldb/test/API/functionalities/scripted_frame_provider/circular_dependency/TestFrameProviderCircularDependency.py
+1-01 files

LLVM/project 679a004llvm/test/CodeGen/AMDGPU hazard-gfx1250-flat-scr-hi.mir

[AMDGPU] Avoid undefs in hazard-gfx1250-flat-scr-hi.mir. NFC
DeltaFile
+57-32llvm/test/CodeGen/AMDGPU/hazard-gfx1250-flat-scr-hi.mir
+57-321 files

LLVM/project 83ab875llvm/lib/Target/AMDGPU SIInstrInfo.cpp, llvm/test/CodeGen/AMDGPU hazard-gfx1250-flat-scr-hi.mir

[AMDGPU] Handle phys regs in flat_scratch_base_hi operand check (#170395)

DeltaFile
+13-0llvm/test/CodeGen/AMDGPU/hazard-gfx1250-flat-scr-hi.mir
+2-1llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+15-12 files

LLVM/project 271e99dclang/lib/CIR/CodeGen CIRGenBuiltinX86.cpp, clang/test/CIR/CodeGenBuiltins/X86 xop-builtins.c avx512f-builtins.c

[CIR] Support x86 builtin rotate (#169566)

This PR implements CodeGen for rotate builtins in CIR upstream.
Issue https://github.com/llvm/llvm-project/issues/167765
DeltaFile
+92-0clang/test/CIR/CodeGenBuiltins/X86/xop-builtins.c
+42-0clang/lib/CIR/CodeGen/CIRGenBuiltinX86.cpp
+33-0clang/test/CIR/CodeGenBuiltins/X86/avx512f-builtins.c
+167-03 files

LLVM/project 7685e1flldb/unittests/SymbolFile/DWARF DWARFASTParserClangTests.cpp

[lldb][test] DWARFASTParserClangTests: extract test setup into helper structure (#170132)

Depends on:
* https://github.com/llvm/llvm-project/pull/170249

We keep repeating the boilerplate of creating a
`DWARFASTParserClangStub` and `TypeSystemClangHolder` in all the
unit-test cases. Lets extract this into a helper to make the tests
easier to grok.

We actually only need the `DWARFASTParserClangStub` and a
`TypeSystemClangHolder` in one of the test cases. For the rest, we can
just re-use the typesystem/parser that the `YAMLModuleTester` created.
Re-using them makes it more straightforward to write test-cases because
we don't need to worry about which TypeSystem which DWARFParser created
types into.
DeltaFile
+109-166lldb/unittests/SymbolFile/DWARF/DWARFASTParserClangTests.cpp
+109-1661 files

LLVM/project 9d6d284clang/test/CodeGen/arm-mve-intrinsics ternary.c, llvm/test/CodeGen/AArch64 fixed-length-bf16-arith.ll

Address review comments

Created using spr 1.3.6-beta.1
DeltaFile
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+3,019-0llvm/test/Transforms/AggressiveInstCombine/umulh_carry4.ll
+1,499-0llvm/test/CodeGen/ARM/fp-intrinsics-vector.ll
+692-320clang/test/CodeGen/arm-mve-intrinsics/ternary.c
+936-0llvm/test/CodeGen/AArch64/fixed-length-bf16-arith.ll
+905-0llvm/test/CodeGen/X86/combine-icmp.ll
+11,776-3201,592 files not shown
+70,461-17,0841,598 files

LLVM/project 70efdedclang/test/CodeGen/arm-mve-intrinsics ternary.c, llvm/test/CodeGen/AArch64 fixed-length-bf16-arith.ll

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.6-beta.1

[skip ci]
DeltaFile
+4,725-0llvm/test/tools/llvm-mca/RISCV/SpacemitX60/vlseg-vsseg.s
+3,019-0llvm/test/Transforms/AggressiveInstCombine/umulh_carry4.ll
+1,499-0llvm/test/CodeGen/ARM/fp-intrinsics-vector.ll
+692-320clang/test/CodeGen/arm-mve-intrinsics/ternary.c
+936-0llvm/test/CodeGen/AArch64/fixed-length-bf16-arith.ll
+905-0llvm/test/CodeGen/X86/combine-icmp.ll
+11,776-3201,591 files not shown
+70,451-17,0791,597 files

LLVM/project f0cd274llvm/lib/IR ReplaceConstant.cpp, llvm/test/CodeGen/AMDGPU lower-module-lds-constantexpr.ll same-lds-variable-multiple-use-in-one-phi-node.ll

[𝘀𝗽𝗿] initial version

Created using spr 1.3.6-beta.1
DeltaFile
+47-41llvm/test/CodeGen/AMDGPU/lower-module-lds-constantexpr.ll
+51-0llvm/test/CodeGen/AMDGPU/same-lds-variable-multiple-use-in-one-phi-node.ll
+26-17llvm/test/CodeGen/AMDGPU/lower-kernel-lds-constexpr.ll
+16-5llvm/lib/IR/ReplaceConstant.cpp
+140-634 files

LLVM/project ac19d38lldb/source/Plugins/SymbolFile/DWARF SymbolFileDWARF.cpp DWARFASTParserClang.cpp, lldb/unittests/Symbol TestClangASTImporter.cpp

[lldb][DWARFASTParserClang] Complete and make use of LLVM's RTTI support (#170249)

We almost had RTTI support for `DWARFASTParserClang`, but because
`classof` was protected, using `llvm::cast`/etc. on it would fail to
compile with:
```
llvm/include/llvm/Support/Casting.h:64:57: error: 'classof' is a protected member of 'DWARFASTParserClang'
   64 |   static inline bool doit(const From &Val) { return To::classof(&Val); }
      |                                                         ^
llvm/include/llvm/Support/Casting.h:110:32: note: in instantiation of member function 'llvm::isa_impl<DWARFASTParserClang, lldb_private::plugin::dwarf::DWARFASTParser>::doit' requested here
  110 |     return isa_impl<To, From>::doit(*Val);
```

This patch makes `classof` public and turns `static_cast`s of
`DWARFASTParserClang` into `llvm::cast`s.
DeltaFile
+5-6lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
+4-6lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
+5-4lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.h
+2-2lldb/unittests/SymbolFile/DWARF/DWARFASTParserClangTests.cpp
+1-1lldb/unittests/Symbol/TestClangASTImporter.cpp
+17-195 files

LLVM/project dea86c6llvm/test/CodeGen/AMDGPU occupancy-levels.ll

[AMDGPU][NFC] Add occupancy checks for gfx950 and gfx1250 (#170392)

DeltaFile
+82-0llvm/test/CodeGen/AMDGPU/occupancy-levels.ll
+82-01 files

LLVM/project b30a48clldb/test/API/functionalities/scripted_frame_provider TestScriptedFrameProvider.py

[lldb/test] Fix scripted frame provider tests on ARM32

On ARM32, FixCodeAddress unconditionally clears bit 0 (the Thumb bit)
from all code addresses, including synthetic frame PCs. This causes
test failures where synthetic PCs like 0xFFFF and 0xDEADBEEF become
0xFFFE and 0xDEADBEEE respectively.

This adjusts the tests to expect the modified PC values on ARM32.

Signed-off-by: Med Ismail Bennani <ismail at bennani.ma>
DeltaFile
+13-4lldb/test/API/functionalities/scripted_frame_provider/TestScriptedFrameProvider.py
+13-41 files

LLVM/project e05fffbclang/include/clang/Basic BuiltinTemplates.td, clang/lib/Sema SemaTemplate.cpp SemaTypeTraits.cpp

Revert "[Clang] Add __builtin_common_reference (#121199)"

This reverts commit 3b9e203364dcd8234b12eb447ddbcf97a877558c.
Causes not-yet-understood semantic differences, see commits
on #121199.
DeltaFile
+19-341clang/lib/Sema/SemaTemplate.cpp
+0-136clang/test/SemaCXX/type-trait-common-reference.cpp
+87-5clang/lib/Sema/SemaTypeTraits.cpp
+0-77clang/lib/Sema/SemaType.cpp
+14-28libcxx/include/__type_traits/common_reference.h
+2-28clang/include/clang/Basic/BuiltinTemplates.td
+122-6153 files not shown
+122-6479 files

LLVM/project 78f3a2allvm/test/CodeGen/AMDGPU hazard-gfx1250-flat-scr-hi.mir

[AMDGPU] Avoid undefs in hazard-gfx1250-flat-scr-hi.mir. NFC
DeltaFile
+54-30llvm/test/CodeGen/AMDGPU/hazard-gfx1250-flat-scr-hi.mir
+54-301 files

LLVM/project c5e9289llvm/test/tools/llvm-exegesis/RISCV/rvv filter.test

[llvm-exegesis] Make rvv/filter.test deterministic

This should prevent the flaky failures that have been plaguing the
buildbots since the test was introduced and allow for offline
investigation without disrupting CI.

Reviewers: topperc, mshockwave

Reviewed By: mshockwave

Pull Request: https://github.com/llvm/llvm-project/pull/170014
DeltaFile
+5-3llvm/test/tools/llvm-exegesis/RISCV/rvv/filter.test
+5-31 files

LLVM/project 325a082lldb/include/lldb/API SBTrace.h

[lldb] Fix Doxygen warning in SBTrace.h (#170394)

Remove errant `\a` command before `<directory>` in `SaveToDisk`
documentation. The `\a` Doxygen command expects a word argument, but
`<directory>` starts with `<` which Doxygen interprets as HTML. This
fixes:

```
llvm-project/lldb/include/lldb/API/SBTrace.h:60:
Warning 564: Error parsing Doxygen command a: No word followed the command. Command ignored.
```
DeltaFile
+1-1lldb/include/lldb/API/SBTrace.h
+1-11 files

LLVM/project 94c8940lldb/docs/resources lldbgdbremote.md

lldbgdbremote.md: Update `qWasmLocal` result description (#170393)

The current description mistakenly specified that an address of a local
value in some address space is returned. When testing this with Wasm
runtimes that already implement this command, it can be observed that
the value itself is returned. The value itself may be an address for
languages that use shadow stack in Wasm linear memory, but the value of
an arbitrary local does not always contain that address.
DeltaFile
+3-3lldb/docs/resources/lldbgdbremote.md
+3-31 files

LLVM/project 662e38cmlir/include/mlir/Dialect/XeGPU/IR XeGPUAttrs.td, mlir/lib/Dialect/XeGPU/IR XeGPUDialect.cpp

broadcast 1d/scalar to 2d, propagation and sg distribution
DeltaFile
+125-3mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp
+61-0mlir/test/Dialect/XeGPU/subgroup-distribute.mlir
+25-1mlir/lib/Dialect/XeGPU/Transforms/XeGPUPropagateLayout.cpp
+21-0mlir/lib/Dialect/XeGPU/IR/XeGPUDialect.cpp
+12-2mlir/include/mlir/Dialect/XeGPU/IR/XeGPUAttrs.td
+244-65 files

LLVM/project a9dcbf8llvm/lib/Target/AMDGPU SIInstrInfo.cpp, llvm/test/CodeGen/AMDGPU hazard-gfx1250-flat-scr-hi.mir

[AMDGPU] Handle phys regs in flat_scratch_base_hi operand check
DeltaFile
+13-0llvm/test/CodeGen/AMDGPU/hazard-gfx1250-flat-scr-hi.mir
+2-1llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+15-12 files

LLVM/project 9fd288eclang/lib/Basic/Targets AMDGPU.h, clang/test/CodeGenOpenCL address-spaces.cl builtins-alloca.cl

clang/AMDGPU: Enable opencl 2.0 features for unknown target (#170308)

Assume amdhsa triples support flat addressing, which matches
the backend logic for the default target. This fixes the
rocm device-libs build.
DeltaFile
+13-1clang/lib/Basic/Targets/AMDGPU.h
+3-2clang/test/CodeGenOpenCL/address-spaces.cl
+2-2clang/test/CodeGenOpenCL/builtins-alloca.cl
+3-0clang/test/Misc/amdgcn.languageOptsOpenCL.cl
+21-54 files

LLVM/project f951d6dclang/lib/Sema SemaDeclAttr.cpp

Return nullptr in mergeModularFormatAttr
DeltaFile
+1-3clang/lib/Sema/SemaDeclAttr.cpp
+1-31 files

LLVM/project 9dd3346llvm/lib/Target/AMDGPU SIInstrInfo.cpp GCNSubtarget.h, llvm/test/CodeGen/AMDGPU hazard-gfx1250-flat-scr-hi.mir

[AMDGPU] Prevent folding of flat_scr_base_hi into a 64-bit SALU (#170373)

Fixes: SWDEV-563886
DeltaFile
+145-0llvm/test/CodeGen/AMDGPU/hazard-gfx1250-flat-scr-hi.mir
+11-0llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+6-0llvm/lib/Target/AMDGPU/GCNSubtarget.h
+162-03 files

LLVM/project dd1b4abclang/lib/CodeGen CGExprScalar.cpp, clang/lib/Sema SemaOverload.cpp

[HLSL][Matrix] Add support for Matrix element and trunc Casts (#168915)

fixes #168737
fixes #168755

This change fixes adds support for Matrix truncations via the
ICK_HLSL_Matrix_Truncation enum. That ends up being most of the files
changed.

It also allows Matrix as an HLSL Elementwise cast as long as the cast
does not perform a shape transformation ie 3x2 to 2x3.

Tests for the new elementwise and truncation behavior were added. As
well as sema tests to make sure we error n the shape transformation
cast.

I am punting right now on the ConstExpr Matrix support. That will need
to be addressed later. Will file a seperate issue for that if reviewers
agree it can wait.
DeltaFile
+293-0clang/test/SemaHLSL/MatrixElementOverloadResolution.hlsl
+219-0clang/test/CodeGenHLSL/BasicFeatures/MatrixElementTypeCast.hlsl
+156-0clang/test/CodeGenHLSL/BasicFeatures/MatrixExplicitTruncation.hlsl
+138-0clang/test/CodeGenHLSL/BasicFeatures/MatrixImplicitTruncation.hlsl
+67-6clang/lib/Sema/SemaOverload.cpp
+58-4clang/lib/CodeGen/CGExprScalar.cpp
+931-1021 files not shown
+1,084-2327 files

LLVM/project 45918f5llvm/include/llvm/ADT SetVector.h, llvm/unittests/ADT SetVectorTest.cpp

[llvm][NFC] In `SetVector`, `contains` and `count` now automatically accept `const T *` arguments when the key is `T *` (#170377)

Also use `is_contained` to implement `contains`, since this tries the
`contains` member function of the set type first.
DeltaFile
+7-5llvm/include/llvm/ADT/SetVector.h
+1-1llvm/unittests/ADT/SetVectorTest.cpp
+8-62 files

LLVM/project 6c32535clang/lib/Sema AnalysisBasedWarnings.cpp, clang/lib/StaticAnalyzer/Core BugReporter.cpp

[clang][NFC] Remove unused CFGStmtMap.h includes (#170383)

DeltaFile
+0-1clang/lib/Sema/AnalysisBasedWarnings.cpp
+0-1clang/lib/StaticAnalyzer/Core/BugReporter.cpp
+0-22 files

LLVM/project e9c1274llvm/lib/Transforms/IPO LowerTypeTests.cpp, llvm/test/Transforms/LowerTypeTests function.ll

[LTT] mark the CFI jumptable naked on Windows (#170371)

We were not marking the `.cfi.jumptable`​ functions as `naked`​ on windows. The referenced bug (https://llvm.org/bugs/show_bug.cgi?id=28641#c3) appears to be fixed:

```bash
build/bin/opt -S -passes=lowertypetests -mtriple=i686-pc-win32 llvm/test/Transforms/LowerTypeTests/function.ll | build/bin/llc -O0
```

```
L_.cfi.jumptable:                       # @.cfi.jumptable
# %bb.0:                                # %entry
        #APP
        jmp     _f.cfi at PLT
        int3
        int3
        int3

        #NO_APP
        #APP

    [12 lines not shown]
DeltaFile
+5-6llvm/test/Transforms/LowerTypeTests/function.ll
+1-6llvm/lib/Transforms/IPO/LowerTypeTests.cpp
+6-122 files

LLVM/project 82158d6llvm/lib/Transforms/Vectorize SLPVectorizer.cpp

Add assertion

Created using spr 1.3.7
DeltaFile
+1-0llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
+1-01 files

LLVM/project ffc8742clang/include/clang/Basic AttrDocs.td, clang/include/clang/Sema Sema.h

Allow only duplicates, whether on the same decl or redecls
DeltaFile
+34-26clang/lib/Sema/SemaDeclAttr.cpp
+10-10clang/test/CodeGen/attr-modular-format.c
+9-4clang/test/Sema/attr-modular-format.c
+5-0clang/lib/Sema/SemaDecl.cpp
+5-0clang/include/clang/Sema/Sema.h
+4-0clang/include/clang/Basic/AttrDocs.td
+67-401 files not shown
+69-407 files

LLVM/project 6bdb838clang/lib/CIR/CodeGen CIRGenBuiltinX86.cpp, clang/test/CIR/CodeGenBuiltins/X86 avx-builtins.c avx512bw-builtins.c

[CIR] Upstream vec shuffle builtins in CIR codegen (#169178)

This PR is part of #167752. It upstreams the codegen and tests for the
shuffle builtins implemented in the incubator, including:
- `vinsert` + `insert`
- `pblend` + `blend`
- `vpermilp`
- `pshuf` + `shufp`
- `palignr`

It does NOT upstream the `perm`, `vperm2`, `vpshuf`, `shuf_i` / `shuf_f`
and `align` builtins, which are not yet implemented in the incubator.

This _is_ a large commit, but most of it is tests.

The `pshufd` / `vpermilp` builtins seem to have no test coverage in the
incubator, what should I do?
DeltaFile
+89-5clang/lib/CIR/CodeGen/CIRGenBuiltinX86.cpp
+73-1clang/test/CIR/CodeGenBuiltins/X86/avx-builtins.c
+50-10clang/test/CIR/CodeGenBuiltins/X86/avx512bw-builtins.c
+53-2clang/test/CIR/CodeGenBuiltins/X86/sse2-builtins.c
+53-0clang/test/CIR/CodeGenBuiltins/X86/avx2-builtins.c
+48-0clang/test/CIR/CodeGenBuiltins/X86/avx512f-builtins.c
+366-182 files not shown
+395-198 files

LLVM/project 9c78bc5llvm/test/CodeGen/AMDGPU amdgpu-codegenprepare-idiv.ll fshr.ll, llvm/test/Transforms/LoadStoreVectorizer/AMDGPU merge-vectors-complex.ll

Revert "[LSV] Merge contiguous chains across scalar types" (#170381)

Reverts llvm/llvm-project#154069. I pointed out a number of issues
post-merge, most importantly examples of miscompiles:
https://github.com/llvm/llvm-project/pull/154069#issuecomment-3603854626.

While the motivation of the change is clear, I think the implementation
approach is flawed. It seems like the goal is to allow elements like
`load <2xi16>` and `load i32` to be vectorized together despite the
current algorithm not grouping them into the same equivalence classes. I
personally think that if we want to attempt this it should be a more
wholistic approach, maybe even redefining the concept of an equivalence
class. This current solution seems like it would be really hard to do
bug-free, and even if the bugs were not present, it is only able to
merge chains that happen to be adjacent to each other after
`splitChainByContiguity`, which seems like it is leaving things up to
chance whether this optimization kicks in. But we can discuss more in
the re-land. Maybe the broader approach I'm proposing is too difficult,
and a narrow optimization is worthwhile. Regardless, this should be
reverted, it needs more iteration before it is correct.
DeltaFile
+798-831llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-idiv.ll
+206-197llvm/test/CodeGen/AMDGPU/fshr.ll
+162-171llvm/test/CodeGen/AMDGPU/fdiv.ll
+66-258llvm/test/Transforms/LoadStoreVectorizer/AMDGPU/merge-vectors-complex.ll
+120-192llvm/test/CodeGen/AMDGPU/llvm.amdgcn.mfma.gfx950.ll
+149-141llvm/test/CodeGen/AMDGPU/fshl.ll
+1,501-1,79044 files not shown
+2,779-3,99250 files

LLVM/project e5f1d02clang/lib/CIR/Dialect/Transforms LoweringPrepare.cpp, clang/test/CIR/CodeGen cxx-special-member-attr.cpp struct.cpp

[CIR] Lower calls to trivial copy constructor to cir::CopyOp (#168281)

This PR is a follow up to #167975 and replaces calls to trivial copy
constructors with `cir::CopyOp`.

---------

Co-authored-by: Andy Kaylor <akaylor at nvidia.com>
Co-authored-by: Henrich Lauko <henrich.lau at gmail.com>
DeltaFile
+21-14clang/test/CIR/CodeGen/cxx-special-member-attr.cpp
+23-1clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp
+8-10clang/test/CIR/CodeGen/struct.cpp
+8-7clang/test/CIR/CodeGenOpenACC/compute-firstprivate-clause.cpp
+6-6clang/test/CIR/CodeGenOpenACC/combined-firstprivate-clause.cpp
+3-3clang/test/CIR/CodeGenOpenACC/firstprivate-clause-recipes.cpp
+69-412 files not shown
+73-458 files