[AMDGPU] Add dot product patterns with saturating add (clamp) (#187945)
Add pattern matching for dot product operations combined with saturating
add intrinsics (llvm.uadd.sat / llvm.sadd.sat). This enables the
compiler to generate dot instructions with the clamp modifier instead of
separate dot + saturating add instructions.
Fixes #182095
## Changes
- Added UDot2SatPat and SDot2SatPat TableGen pattern classes that match
uaddsat/saddsat with dot2 computations
- Added performSatAddCombine DAG combiner function to handle
ISD::UADDSAT and ISD::SADDSAT nodes
- Added test file idot2-sat.ll
## Example
[16 lines not shown]
Introduce and use dtls12_handshake_msg.
Add struct dtls12_handshake_msg and various related functions, which
allow for the construction of DTLS handshake messages and associated
fragments.
Use this on the DTLS write path for sending handshake message fragments.
This means that we no longer modify the init buffer, which also fixes a
bug where the message callback is called with a corrupted handshake
message when multiple fragments have been sent.
We also now correctly track fragment offsets when sending a handshake
message that results in multiple calls to dtls1_do_write_handshake_message().
This is the first step towards further untangling of the write path in
the legacy TLS stack.
ok kenjiro@ tb@
Revert "[clang][NFC] Mark CWG717 as implemented and add a test (#197732)" (#198074)
As reported in #197930, these new tests fail on the
`arm64-apple-darwin-unknown` target. There's not a consensus yet on how
to fix the breakage, so revert it until we can decide.
kernel: Use lockowned() instead of (lockstatus(&lock, curthread) != 0).
It is sufficient and more lightweight.
Most of these lockstatus() were added by me.
Discussed-with: aly
update to zsync-0.6.4
specific gzip support is deprecated and will be removed in a future version
(upstream suggest using gzip --rsyncable, if you have a version of gzip
which supports it; openbsd's doesn't)
neuter shared memory (shm*) in webrtc to unbreak screen sharing while using pledge
the same patch is used in chromium to take the XImage API codepath instead
of using shared memory which cannot be used while pledged
ok landry@
gstreamer1: updated to 1.28.3
GStreamer 1.28.3 stable bug fix release
Highlighted bugfixes:
Various security fixes and playback fixes
applemedia: vtdec stability, MoltenVK integration and planar video format handling fixes
audioresample: Fix regression on armv7hf
bpmdetect: Fixes for stereo and multi-channel modes
devicemonitor: wait for start thread to finish when listing devices so all the info is there for e.g. v4l2 provider
fallbacksrc: Add fallback-source and enable-dummy properties
nvidia: fix cudaconvert performance regression and nvdec device creation regression
opengl: add GBRA swizzle support, and fix glcolorconvert vertical flip issue on crop
rtspsrc: include user-agent property in HTTP tunnel requests and fix mikey regression
threadshare: add leaky mode to dataqueue-based elements
v4l2: fix negotiation error when trying to force stateful decoders to output dmabufs
webrtcsink: Add support imx8mp vpuenc_hevc hardware H.265 encoder
cerbero: Extend gst-plugins-rs melding to Darwin platforms for smaller binary sizes and static linking improvements
[3 lines not shown]