[AMDGPU][doc] Refactor Barrier Execution Model (#204566)
Remove everything that has to do with named barriers and put it in a
series of model extensions specific to /sbarrier/named-barriers.
I had to change a few things to make it fit, in summary:
Base Model:
- (~) Stylistic changes that make it easier to refer to specific rules.
Each rule is in a rubric instead of a bullet point.
- (-) No longer defines `barrier-mutually-exclusive`
- (-) No longer defines barrier `join` and any associated rule.
New named barrier extensions
- (+) Define "named barrier" as a sub-type of barrier objects. This
makes barrier-mutually-exclusive redundant.
- (+) Define barrier join as an op that can exclusively be done on
[17 lines not shown]
[clang] Respect `CLANG_USE_EXPERIMENTAL_CONST_INTERP` (#200716)
Seems like https://github.com/llvm/llvm-project/pull/199396 had no
effect at all, even though the patch itself seems pretty obvious.
Change the semantics of the command-line option to support
`-fno-experimental-constant-interpreter` as well. This way, the cmake
option can be used to set the default and the `-f`/`-fno-` command-line
options can be used to override the default behavior.
[flang][PFT-to-MLIR] Wrap unstructured Fortran constructs in scf.execute_region
Extend the PFT-to-MLIR (HLFIR/FIR) lowering so unstructured DO and IF
constructs are emitted inside scf.execute_region, hiding their multi-block
CFG behind a single op. OpenACC and OpenMP lowerings that reject
multi-block content (e.g. the "unstructured do loop in combined acc
construct" TODO in OpenACC.cpp) now see a structured op instead.
Flag: -mmlir --wrap-unstructured-constructs-in-execute-region (default on).
An evaluation is wrappable iff all of the following hold:
* wrap flag on
* eval is parser::DoConstruct or parser::IfConstruct
* eval.isUnstructured
* branchesAreInternal(eval) -- every controlSuccessor in the subtree
targets a nested eval or the constructExit
* !hasIncomingBranch(eval) -- no outside eval branches into the body
(PFT's synthetic IfConstruct around `if(c) goto X` absorbs label
[14 lines not shown]
[orc-rt] Add SPS serialization for ExecutorAddrRange. (#205041)
Allows SPS serialization to/from ExecutorAddrRange. This will be used in
upcoming patches for compact-unwind registration support.
[FIR] Route embox + projected complex slice through shapeVec
When the array_coor base is a fir.embox with a projected complex %re/%im
slice, take the shapeVec path instead of the descriptor (fir.box_dims)
path. The descriptor path iterates source-rank dims while querying the
rank-reduced embox result box, which miscompiles slices that collapse
dims (e.g. complex(:,k)%re). For embox-derived boxes the underlying
storage is contiguous, so the shape-derived layout is both correct and
the natural place to encode that static shape is available. Non-embox
boxes (rebox, assumed-shape) still go through fir.box_dims.
Co-Authored-By: Claude Sonnet 4.6 <noreply at anthropic.com>
[AArch64] Lower extends of boolean vector loads via scalar load (#203394)
Replace a `load <N x i1>` under a sext/zext with a scalar load +
bitcast, so the `combineToExtendBoolVectorInReg` helper can apply,
avoiding scalarization.
Optimisation for the SVE case with a predicate load to be added in a
follow up.
Fixes #200325
shells/oh-my-posh: update to 29.17.0
v29.17.0
Bug Fixes
fish: use portable stty and fix cursor var assignment (fe6b0bd)
git: update no_commits_icon default to Nerd Fonts v3 glyph (513f32e), closes #7554
Features
fish: support cursor positioning (a696a08)
project: add Lean 4 lakefile project name detection (cb5300e)
v29.16.0
Features
aws: expose all shared config settings via .Settings map (42f4772)
spotify: switch Windows and WSL to SMTC-based detection (40ce7a3)
ffmpeg6 ffplay6: updated to 6.1.6
6.1.6:
configure: do not use interval regexp operators with awk
configure: fix symbol prefix detection
configure: Disable linker warnings for common symbols
avformat/av1dec: Check bits left before get_leb128()
avformat/dhav: Check amount read
avformat/mov: ensure required number of bytes is read
avcodec/cbs_sei: Always zero-initialize SEI payload
avformat/nuv: return early if header id fails to read
avformat/mpegts: Don't use uninitialized value in av_log()
avformat/hls: update current segment reference before use
avcodec/parser: Reset *buf_size on realloc failure
avfilter/af_stereotools: round-up max size of buffer
avcodec/hevc/hevcdec: Clean sao_pixel_buffer_v on allocation
avcodec/hevc/hevcdec: initialize qp_y_tab
swscale/alphablend: don't overread alpha plane on subsampled odd size
avformat/mov: fix crash when trying to get a fragment time for a non-existing fragment
[96 lines not shown]
ffmpeg7 ffplay7: updated to 7.1.5
7.1.5:
tests/checkasm/sw_rgb: don't write random data past the end of the buffer
swscale/alphablend: don't overread alpha plane on subsampled odd size
avcodec/vp8: Maintain consistency of frame pointers
avcodec/jpeg2000dec: clear array length when freeing it
av1dec: Don't crash if decoding of some frames have failed
avcodec/tdsc: propagate max_pixels to the JPEG tile decoder
avcodec/imm5: propagate max_pixels to the H264/HEVC sub-decoders
avcodec/cri: propagate max_pixels to the JPEG tile decoder
swscale: Fix out-of-bounds write errors in yuv2rgb_lasx.c file.
update for 7.1.5
avformat/mov: fix crash when stsz_sample_size is zero and sample_sizes is null
avformat/dashdec: Fail with any inner stream count being 0
swscale/x86/rgb_2_rgb: fix uyvytoyuv422 overwrite on odd width
avcodec/snowenc: fix SIGFPE in get_dc() when a block lies outside the plane
avcodec/snowenc: fix out-of-bounds OBMC read in get_dc() for narrow planes
avcodec/snowenc: fix out-of-bounds memcpy in get_block_rd() for narrow planes
[129 lines not shown]
net/samba416: Fix build with newer Python
PyEval_CallObjectWithKeywords() has been deprecated since Python 3.9 and
was removed in Python 3.13. Add an upstream patch that replaces it with
PyObject_Call() and bump the upper Python version limit to 3.14.
Reviewed by: kiwi
Differential Revision: https://reviews.freebsd.org/D57713
[orc-rt] Tidy up some SPS tag types. NFC. (#205038)
Replaces class definitions with decls for tag types that don't need a
body, and moves the SPSError tag down to just above it's
serialization-traits class.