[mlir][nvgpu] Add roundtrip tests for warpgroup MMA operations (#199272)
Add roundtrip tests for the following NVGPU dialect operations:
- `nvgpu.warpgroup.mma.init.accumulator`
- `nvgpu.warpgroup.mma.store`
These operations were present in the dialect but lacked roundtrip
test coverage in `roundtrip.mlir`. The tests verify that operand
and result types survive the parser/printer roundtrip.
[IR] BlockAddress doesn't use BasicBlock (#200772)
BlockAddress is the only non-terminator user of a BasicBlock, and it
occurs very rarely. To speed up predecessor iteration, change
BlockAddress to no longer use its BasicBlock.
This should also make uselistorder_bb obsolete.
py-idna: update to 3.17.
## 3.17 (2026-05-28)
- Substantial 75% reduction in memory usage through new data
structures and some optimization in processing speed.
- Added a general 1024-character input length cap to the public
validation, conversion, and codec entry points. This is well above
any legitimate domain or label and guards against pathological
inputs.
[flang] Add support for TYPEOF and CLASSOF type specifiers (#188804)
Implements parsing, semantics and lowering for the Fortran 2023 TYPEOF and CLASSOF type specifiers (R703). TYPEOF produces the declared type of a data-ref; CLASSOF produces a polymorphic version. Includes constraint
checks (C709–C713) and tests.
Semantics resolves TYPEOF/CLASSOF to the concrete underlying type in the symbol table, so no lowering code changes were needed. Added a lowering test to verify FIR/HLFIR generation works correctly for intrinsic types,
derived types, extended types, and polymorphic CLASSOF with allocatable/pointer.
Fixes - https://github.com/llvm/llvm-project/issues/185635
py-docutils: update to 0.23.
Release 0.23 (2026-05-27)
=========================
General:
- Define `public API and backwards compatibility policy`_.
rST parser:
- Problems with the "include" directive are reported as ERROR, not SEVERE.
- The "include" directive options :start-after: and :end-before: may now
also be used without value (standing for an empty line).
- The highlight language of a custom role based on the `"code" role`_
defaults to the role's name (if supported by Pygments_).
Specifying ``:language: none`` turns off syntax highlight.
HTML5 writer:
- If a section has several IDs, use the last one (from the first
preceding `explicit target`__) as self-link_.
[31 lines not shown]
[Clang][HIP] Guard declarations of cmath comparisons when using Microsoft's STL
In HIP, constexpr functions are treated as both, __host__ and
__device__.
A new version of the MS STL shipped with the build tools version 14.51.36231
has constexpr definitions for some cmath functions when the compiler in use is Clang.
These definitions conflict with the __device__ delcarations we provide in
the header wrappers.
This patch guards these declarations/definitions to use the ones coming
from the STL.
[GlobalISel] Do not use recordsOperand() to check if a Combiner C++ predicate can be hoisted
The combiner does not use RecordNamedOperand, so assume that any operand is "recorded" in the sense
that can be used by a C++ predicate.
Wait for bdev to actually disappear after delete (#19042)
spdk_bdev_unregister's cb_fn (which sends the RPC reply) fires before
spdk_bdev_close(desc) in spdk_bdev_unregister_by_name, so
bdev_aio_delete can return while the underlying close(fd) on the zvol is
still pending.
(cherry picked from commit c0e0111ef4547c42fc40a1677cf5b1691c2e57cf)
NAS-141202 / 27.0.0-BETA.1 / Wait for bdev to actually disappear after delete (#19042)
spdk_bdev_unregister's cb_fn (which sends the RPC reply) fires before
spdk_bdev_close(desc) in spdk_bdev_unregister_by_name, so
bdev_aio_delete can return while the underlying close(fd) on the zvol is
still pending.
[clang][SemaCXX] Fix crash caused by unresolved overloaded function type when using `__builtin_bit_cast` (#200574)
Resolves #200112
By early checking for placeholder expressions, crash can be avoided for
unreachable builtin type when fetching type info.
[X86] Fold XOR of VGF2P8AFFINEQB and its source (#198448)
Adds a optimization to fold XORs between `vgf2p8affineqb` and its
source. That XOR can done through the existing affine transformation by
XORing its matrix with the identity matrix. This patch:
- Folds XOR between `vgf2p8affineqb` and its source.
- Only occurs when the matrix is constant, ensuring that it can't
increase the dependency chain.
- Doesn't occur if the affine is multi use, preventing an increase in
code size.
- Includes test coverage for both positive and negative cases.
Fixes #184061