[MIR] Support symbolic inline asm operands (#185893)
Support parsing and printing inline assembly operands in MIR using the
symbolic form instead of numeric register class IDs, thus removing the
need to update tests when the numbers change.
The numeric form remains supported.
---------
Co-authored-by: Claude Opus 4.6 <noreply at anthropic.com>
[libc++] Make the associative container query benchmarks more representative (#183036)
Currently the query benchmarks are training the branch predictor
incredibly well, which isn't representative of the real world. This
change causes the branch misses to go from <1% to ~50% with the current
implementation of `__tree::__find_end`.
This patch also removes the `non-existent` benchmarks, since it'd be
non-trivial to write a representative benchmark for that case, and the
benchmark would be relatively low value. We're already searching to leaf
nodes ~50% of the time (since half the nodes are leaves) with the
current benchmark. So we'd only additionally cover a relatively trivial
failure branch that is only taken once per function call. The loop is
already covered through benchmarking with keys existing in the
container.
[mlir][tosa] Allow integer gather/scatter ops in fp profile (#183342)
This commit updates profile compliance to allow integer gather and
scatter operations to be used with the floating point profile. This
update aligns with the specification change:
https://github.com/arm/tosa-specification/pull/35.
[CIR] Implement zero-init-bases lowering (#186230)
This showed up in a test suite. A zero-initializer for a whole struct
seems completely sensible, as long as the type is zero-initializable.
This patch doesn't change the non-zero-init behavior (I am working on a
patch to do so, but it is a massive scope), so this is limited to JUST
classes with bases.
[VectorCombine] Fix crash in foldShuffleOfSelects for single-element shuffle result (#185713)
In foldShuffleOfSelects, if the shuffle result has a single element, the
resulting type may be scalar rather than a vector. The later code in
foldShuffleOfSelects assumes the result is a vector and performs cast<
FixedVectorType >, which triggers an assertion.
Fixes #183625
[AMDGPU] Pass MF into the SIInsertWaitcnts constructor. NFC. (#186369)
Pass MF into the SIInsertWaitcnts constructor instead of the run method.
This is more natural now that SIInsertWaitcnts is constructed once per
MachineFunction and enables future cleanup by initializing more fields
in the constructor that depend on MF.
[mlir][Bytecode] Fix stale deferred worklist entries in attribute callback fallthrough (#186150)
When parseCustomEntry() calls a user attribute/type callback that
internally reads sub-attributes/types via the bytecode reader, the
reader may add entries to the deferredWorklist if the depth limit is
exceeded. If the callback then returns success with an empty entry
(falling through to the regular dialect reader), the reader position is
reset but deferredWorklist retains stale entries from the failed partial
read.
This causes an assert(deferredWorklist.empty()) failure in debug builds
when the fallback dialect reader successfully parses the attribute.
Fix by saving and restoring deferredWorklist.size() around each callback
invocation, discarding any stale entries added during a callback's
partial read when the reader position is rolled back.
Fixes #163337
Assisted-by: Claude Code
[mlir][shape] Fix crash when folding tensor.extract(shape_of(memref)) (#186270)
The `ExtractFromShapeOfExtentTensor` canonicalization pattern was
unconditionally rewriting:
tensor.extract(shape.shape_of(%arg), %idx) -> tensor.dim(%arg, %idx)
even when `%arg` is a memref. This produced an invalid `tensor.dim`
(whose source operand must be a tensor), which then caused an assertion
failure in `DimOp::getSource()` when subsequent canonicalization
patterns tried to match the op:
Assertion `isa<To>(Val) && "cast<Ty>() argument of incompatible type\!"'
failed. [To = TypedValue<TensorType>, From = Value]
Fix: add an `IsTensorType` constraint to
`ExtractFromShapeOfExtentTensor` in `ShapeCanonicalization.td` so the
pattern only fires when `%arg` is a tensor type. The memref case is
intentionally left unfolded (the correct lowering to `memref.dim` would
[8 lines not shown]
[AMDGPU][GlobalIsel] Add register bank legalization rules for amdgcn sin cos intrinsics. (#185934)
This patch adds register bank legalization rules for amdgcn sin and cos
operations in the AMDGPU GlobalISel pipeline.
[X86] Remove single use assumption in combineVectorSizedSetCCEquality (#182200)
Always allow multiple uses of normal loads in mayFoldIntoVector - scalarisation should break the vector load apart again if we fail to use a vector op.
Closes llvm/llvm-project#164632
[NFC][AArch64] ConditionOptimizer Improve readability of cmp adjustment code (#185532)
Change the 'CmpInfo' tuple to a struct and rename adjustCmp() to
getAdjustedCmpInfo()
[clang][ssaf][NFC] Move SSAF from Analysis/Scalable/ to ScalableStaticAnalysisFramework/ (#186156)
- Rename `clang/{include,lib,unittests}/Analysis/Scalable/` to
`clang/{include,lib,unittests}/ScalableStaticAnalysisFramework/Core/`
- Update header-guards with their new paths
- Rename the library `clangAnalysisScalable` to
`clangScalableStaticAnalysisFrameworkCore`
- Add a new `Clang_ScalableStaticAnalysisFramework` module to
`module.modulemap`
- Update GN build files, GitHub PR labeler, and documentation
- Harmonise license comments
- Add a missing header-guard