[GitHub] React to /test-suite comment (#203151)
So the user knows the workflow has kicked off. I've put it in a separate
job with write permissions so the main job should still only have a read
only token.
Reapply "[NVPTX] Support lowering of `(l)lround`" (#202876)
Reverts llvm/llvm-project#202500
Original PR llvm/llvm-project#183901 was mistakenly reverted due to an
unrelated build failure.
[MLIR] Use internal shell for standalone tests (#203134)
The external shell will be removed soon
(https://discourse.llvm.org/t/rfc-removal-of-the-lit-external-shell/90951),
and this is one of the places where it hasn't been enabled by default.
There are no test failures caused by this, so we can just turn it on by
not explicitly setting execute_external as it defaults to False.
Revert "[Compiler-rt][test] Fix circular link dependency between builtins and libc" (#203152)
Reverts llvm/llvm-project#199482 due to failures when it's used on
platforms with non-ELF linkers. The patch needs additional guards, but
it's not immediately clear which platform linkers support the required
options.
[CodeGen] Support --print-changed for legacy codegen IR passes (#202252)
--print-changed is only wired into MachineFunctionPass (
https://reviews.llvm.org/D133055), so the IR-level passes in the codegen
pipeline (atomic-expand, codegenprepare, etc.) are not reported.
Report them from FPPassManager/MPPassManager instead, via a new
Pass::printIRUnit hook that MachineFunctionPass overrides to print MIR.
Analyses are skipped, matching the new pass manager.
Aided by Claude Opus 4.8
[RISC-V] Add --implicit-check-not="error:" to a few tests
Ensures that the test checks for every error emitted by llvm-mc. To do this
we have to move the CHECK lines to the next line rather than the same line
since otherwise we get a false-positive match.
This adds a few missing CHECK line in the xqcibm-invalid test and is needed
to minimize the diff in one of my subsequent commit.
Pull Request: https://github.com/llvm/llvm-project/pull/203091
[MLIR][XeGPU] Support partial subgroup lane distribution (#201667)
for convert_layout
Add lowering support in XeGPUSgToLaneDistribute for values that are
distributed across only a fraction of the subgroup.
- SgToLaneConvertLayout now lowers a rank-2 xegpu.convert_layout that
shrinks the lane layout along the outer (distributed) dimension while
keeping lane_data unchanged (e.g. [16, 1] -> [8, 1]). The partial-subgroup
case is detected directly in the pattern: equal order, rank 2, unit inner
lane layout, and a genuinely distributed outer lane layout (> 1, which also
rules out the degenerate [1, 1] layout). Because the data is no longer
replicated in every lane, it is gathered across lanes and the distributed
outer dimension is doubled when the lane count is halved.
- The cross-lane gather is factored into a dedicated helper,
shuffleDataAsLaneLayoutChange(): it bitcasts the source to i32, issues
gpu.shuffle up to fetch the values from the dropped lanes, and concatenates
[9 lines not shown]