[SelectionDAG] Fix CLMULR/CLMULH expansion (#183537)
For v8i8 on AArch64, `expandCLMUL` picked the zext path (ExtVT=v8i16) since ZERO_EXTEND/SRL were legal, but CLMUL on v8i16 is not, resulting in a bit-by-bit expansion (~42 insns). Prefer the bitreverse path when CLMUL is legal on VT but not ExtVT.
v8i8 CLMULR: 42 → 4 instructions.
Fixes #182780
[MLIR][Vector] Enhance shape_cast unrolling support in case the target shape is [1, 1, ..1] (#183436)
This PR fixes a minor issue in shape_cast unrolling: when all target
dimensions are unit-sized, it no longer removes all leading unit
dimensions.
[MIR] Error on signed integer in getUnsigned (#183171)
Previously we effectively took the absolute value of the APSInt, instead
diagnose the unexpected negative value.
Change-Id: I4efe961e7b29fdf1d5f97df12f8139aac12c9219
coturn: updated to 4.9.0
4.9.0
Changelist:
- Fix thread sanitizer warning
- OpenSSL: migrate to modern API for DH param
- Migrate AES-128 encryption from deprecated OpenSSL API to EVP
- Fix unbounded strcpy in stun_method_str (ns_turn_msg.c)
- Fix missing null termination in addr_to_string (ns_turn_ioaddr.c)
- Fix missing null termination after strncpy in DB drivers
- Fix unbounded strcpy in HTTP response (http_server.c)
- Remove ubuntu20, DEBUG builds from build matrix
- Fix buffer overflow in decryptPassword in mysql
- Fix missing null termination after strncpy in MongoDB driver (dbd_mongo.c)
- Fix buffer overflow in decrypt_aes_128
- Fix Web Admin Password Check
py-scikit-build-core: updated to 0.12.1
0.12.1
This fixes a regression on Windows when targeting 32-bit wheels on a 64-bit
host.
Fixes:
- Use `CMAKE_GENERATOR`(`_PLATFORM`) on Windows again
Documentation:
- Use newly fixed `click-extra` for admonitions
- show CI state of the main branch in the badge
[AMDGPU][Scheduler] Add `GCNRegPressure`-based methods to `GCNRPTarget` (#182853)
This adds a few methods to `GCNRPTarget` that can estimate/perform RP
savings based on `GCNRegPressure` instead of a single `Register`,
opening the door to model/incorporate more complex savings made up of
multiple registers of potentially different classes. The scheduler's
rematerialization stage now uses this new API.
Although there are no test changes this is not really NFC since register
pressure savings in the rematerialization stage are now computed through
`GCNRegPressure` instead of the stage itself. If anything this makes
them more consistent with the rest of the RP-tracking infrastructure.