[AMDGPU] Add WMMA and SWMMAC instructions for gfx1170 (#180731)
Introduce two new subtarget features:
- WMMA256bInsts for GFX11 WMMA instructions and
- WMMA128bInsts for GFX1170 and GFX12 WMMA and SWMMAC instructions
Some WMMA instructions have changed from GFX 11.0 to GFX 11.7 so new
Real versions were added with "_gfx1170" suffix. For consistency all
WMMA and SWMMAC GFX11.7 instructions use this suffix.
To resolve decoding issues between different formats for some WMMA
instructions between GFX 11 and GFX 11.7, new decoding tables were
added.
[clang] Ensure -mno-outline adds attributes (#163692)
Before this change, `-mno-outline` and `-moutline` only controlled the
pass pipelines for the invoked compiler/linker.
The drawback of this implementation is that, when using LTO, only the
flag provided to the linker invocation is honoured (and any files which
individually use `-mno-outline` will have that flag ignored).
This change serialises the `-mno-outline` flag into each function's
IR/Bitcode, so that we can correctly disable outlining from functions in
files which disabled outlining, without affecting outlining choices for
functions from other files. This matches how other optimisation flags
are handled so the IR/Bitcode can be correctly merged during LTO.
Reimport security/polarssl as security/mbedtls
mbedtls-3.6.5 is a SECURITY update over mbedtls-2.18.10 from
security/polarssl.
pkg/DESCR (slightly outdated):
PolarSSL is a fully featured and standards compliant SSL library offering
server and client functionality (SSLv3, TLSv1.0, TLSv1.1 and TLSv1.2) with
an intuitive API and readable source code. Dual license (GPLv2+/commercial).
Non-standard extensions include: SSL Session Tickets (RFC 5077), Server
Name Indication (SNI) (RFC 6066), Truncated HMAC (RFC 6066), Max Fragment
Length (RFC 6066), and Secure Renegotiation (RFC 5746).
Symmetric algorithms: AES, Blowfish, 3DES, DES, ARC4, Camellia, XTEA
Modes: ECB, CBC, CFB, CTR, GCM
Hashes: MD2, MD4, MD5, SHA-1, SHA-224, SHA-256, SHA-384, SHA-512, RIPEMD-160
PolarSSL has its own big number library. Its RSA implementation supports both
[8 lines not shown]
[CIR][NFC] Remove synthetic attribute from TryOp (#181915)
Remove the synthetic attribute from TryOp, which will not be needed
anymore and will be replaced by CleanupScopeOp
[lldb] Unify thread detaching by making HostThreadPosix override Reset() (#179470)
Remove the Detach() method from HostThreadPosix and instead override the
virtual Reset() function from HostNativeThreadBase, following the
pattern already used by HostThreadWindows. This provides a consistent
interface for platform-specific thread cleanup on all platforms.
The HostThreadPosix::Reset() implementation first calls pthread_detach()
before delegating to the base implementation
HostNativeThreadBase::Reset(), similar to how HostThreadWindows calls
CloseHandle() in its Reset() override.
This is a preparatory changes for https://github.com/llvm/llvm-project/pull/177572,
which addresses memory leakage in lldb-server. By unifying the logic of
detaching, it would be possible to introduce the behavioral change in a more
straightforward manner.
www/py-requests-ratelimiter: import new port (version 0.8.0)
This package is a simple wrapper around pyrate-limiter v2 that adds
convenient integration with the requests library.
OK: sthen@
[lldb][RISCV] update RISCV target features in disassembler (#173047)
This patch replaces the hardcoded RISCV feature flags in disassembler
with SubtargetFeatures provided by ArchSpec, which are derived from
the .riscv.attributes ELF section. This ensures the disassembler uses
exactly the RISCV extensions present in the executable, improving the
accuracy and reliability of the disassembly output.
[flang][acc] use new acc.reduction_combine under option (#182066)
Use the new acc.reduction_combine operation that was added in
https://github.com/llvm/llvm-project/pull/181853 when generating
reduction recipies.
To do this, the optional callback of genNoAliasArrayAssignment is
updated to work on addresses and being delegated the assignment part
too.
This change is guarded by two cl switch:
- openacc-use-reduction-combine: enable the change for
real/complex/integer reduction except min/max
- openacc-use-reduction-combine-all: enable the changed for all types
and operators.
devmatch: Add SPDX-License-Identifier tag
Reviewed by: emaste
Sponsored by: The FreeBSD Foundation
Differential Revision: https://reviews.freebsd.org/D55271
ng_ksocket: use new macros to lock socket buffers
PR: 292885
Reported by: Walker R. Thompson <walker.thompson at urz.uni-heidelberg.de>
(cherry picked from commit 1a3d1be4965afddded0b2582b9c4969c1e6a4129)
[emacs] Rework tablegen mode
This commit reworks tablegen-mode to be derived from prog-mode and
removes a lot of the manual work that define-derived-mode does for you
these days, along with fixing other lints (such as an over-long
summary).
This is a major version bump because td-decorators-face has been
renamed to tablegen-decorators-face in order to not pollute other
namespaces.
[NFC][emacs] Fix emacs lists in the LLVM and MLIR modes
This mainly involved explicitly declaring minimum emacs versions for
setq-local and adding a lexical-binding annotaton.
The commit also removes some workarounds from the MLIR mode for Emacs
23 (!).