[OpenMP][OMPIRBuilder] Hoist static parallel region allocas to the entry block on the CPU
Follow-up on #171597, this PR hoists allocas in a parallel region to the
entry block of its corresponding outlined function. This PR does this
for the CPU while #171597 introduced the main mechanism to do so and did
it for the GPU.
[OpenMP][MLIR] Hoist static `alloca`s emitted by private `init` regions to the allocation IP of the construct (#171597)
Having more than 1 descritpr (allocatable or array) on the same
`private` clause triggers a runtime crash on GPUs at the moment.
For SPMD kernels, the issue happens because the initialization logic
includes:
* Allocating a number of temporary structs (these are emitted by flang
when `fir` is lowered to `mlir.llvm`).
* There is a conditional branch that determines whether we will allocate
storage for the descriptor and initialize array bounds from the original
descriptor or whether we will initialize the private descriptor to null.
Because of these 2 things, temp allocations needed for descriptors
beyond the 1st one are preceded by branching which causes the observed
the runtime crash.
This PR solves this issue by hoisting these static `alloca`s
instructions to the suitable allca IP of the parent construct.
One lock to rule them all.
Break the "lock" part of lfs_seglock() into its own function,
lfs_prelock(). Remove the lock flag SEGM_PROT, replacing instances of
lfs_seglock(fs, SEGM_PROT) with lfs_prelock(fs, 0). Reimplement the
fragment lock and cleaner lock to use lfs_prelock().
Avoids an observed deadlock between fragment extension and segment writing.
[InstCombine] Fold redundant FP clamp selects; relax min-max-pattern bailout in visitFCmp (#173452)
visitFCmp() previously bailed out when a following select matched a
clamp pattern. This blocks simplifications when the clamp is provably
redundant.
This PR allows simplification for clamp selects of flavor SPF_FMAXNUM/
SPF_FMINNUM when one arm is a constant and the other is a sitofp/uitofp
of an integer value, and the constant equals the exact min/max of that
integer domain:
* SPF_FMAXNUM (pattern max(X,C)): redundant if C is the minimum integer
mapped exactly to FP (e.g. X = sitofp i8, C = -128.0f).
* SPF_FMINNUM (pattern min(X,C)): redundant if C is the maximum integer
mapped exactly to FP (e.g. X = uitofp i8, C = 255.0f).
This fixes a regression in #173454
---------
Co-authored-by: Copilot <175728472+Copilot at users.noreply.github.com>
Co-authored-by: Yingwei Zheng <dtcxzyw at qq.com>
opencv*: update to 4.13
New Year update for OpenCV 4.x has been released.
Core module:
Modified Input/OutputArray methods to handle 'std::vector' or 'std::vector<std::vector>' in more accurate way #28242
Made cuda::GpuMatND compatible with InputArray/OutputArray #23913
Forced output type for empty matrices where it's defined in API #27972
Added std::vector length check Input/OutputArray #27817
Added 16-bit LUT and corresponding HAL entrypoint #27890, #27911
Add cv::Mat::copyAt for for ROI operation #27318
Extended JSON support in cv::FileStorage: null parsing #27579 and
Added support parsing null in JSON parser in cv::FileStorage #27579 and back slash "" support #27587
Fixed cv::solveCubic numerical instability via coefficient normalization #28117
Fixed tempfile race condition on Windows #28087
Restore parallel framework name on failure attempt #27802
Dropped OPENCV_FOR_OPENMP_DYNAMIC_DISABLE environment variable in favor of standard OMP_DYNAMIC #28122
Enabled fp16 conversions, but disabled NEON FP16 arithmetics on Windows for ARM #27897
Fixed dot product accumulation causing NORM test failures on Windows ARM64 #28211
[181 lines not shown]
ufshci: Enable admin command retry
Also, This patch fixes an issue where the admin command flag was not
passed through.
Sponsored by: Samsung Electronic
Reviewed by: imp
Differential Revision: https://reviews.freebsd.org/D54244
misc/fq: the port had been updated to the latest version 0.16.0
This release includes jq language fixes, a new safetensors decoder,
and minor improvements and fixes to MP4, AVC, and FLAC decoders.
Go 1.23 and later require dependency packages to be explicitly listed
in `vendor/modules.txt' to be imported; populating GH_TUPLE alone is
not enough. Unfortunately, I could not find a robust way to generate
this file during the build in offline mode and must resort to keeping
its static copy for now.
Notified by: upstream (the author), portscout
[Clang][Diagnostics] Mention 'import std' in typeid diagnostic (#173236)
Previously, the diagnostic only suggested including `<typeinfo>`. Since
C++20,the standard library may also be made available via `import std;`.
This change updates the diagnostic to mention `import std` as an
alternative and adds a test to cover the new wording.
kiwix-desktop: update to 2.5.0.
kiwix-destkop 2.5.0
===================
General:
* FIX: Bad english in download error message (@theVeryPulse #1331)
* FIX: Wrong app icon on Linux with Wayland (@Integral-Tech #1384)
* FIX: Allow to directly access ZIM file (ready-only) for Linux flatpak (@kelson42 #800)
* FIX: Avoid fulltext crash scenarios (@vighnesh-sawant #1241)
* NEW: Hidde default Kiwix Server port 80 in URL (@vighnesh-sawant #1417)
* NEW: Introduce "copy link" in context menu (@Bad-ptr #1422)
* NEW: Introduce Parrallels shared folder support (@kelson42 #1313)
* NEW: Tab shortcuts CTRL+PGUP and CTRL+PGDOWN (@kelson42 #1301)
* NEW: Deactivate menu entries when not actionable (@etude11 #1337)
* NEW: Speed control option for Text-to-Speech module (@heropj #1317)
Search(es):
* FIX: Single word suggestion query followed by a space query (@veloman-yunkan #953)
[23 lines not shown]
ncurses: update to 6.6.
20251230 6.6 release for upload to ftp.gnu.org
+ update announcement
+ corrected an ifdef needed for mouse support in MinGW/Windows
+ eliminate remaining duplicate code between MinGW/Windows drivers
20251227
+ make win32_curses.h obsolete in favor of nc_win32.h
+ modify MinGW32 configuration to account for its use of Windows-style
pathnames in filesystem checks.
+ replace --enable-exp-win32 option with --enable-named-pipes
20251220
> in-progress work to merge MinGW/Windows port.
+ eliminate EXP_WIN32_DRIVER with USE_NAMED_PIPES
+ change MS_TERMINAL to DEFAULT_TERM_VAR
20251213
[551 lines not shown]
[RISCV][SelectionDAG] Add a ISD::CTLS node for count leading redundant sign bits. Use it to select CLS(W). (#173417)
The RISC-V P extension adds an instruction equivalent to
__builtin_clrsb. AArch64 has a similar instruction that we currently fail to
select when using the builtin.
This patch adds a combine based on the canonical version of the pattern
emitted by clang for the builtin, (add (ctlz (xor x, (sra x, bw-1)))),
-1). I'm starting the combine at the ctlz because the outer add can
easily be combined into other nodes obscuring the full pattern. So we
generate (add (ctls x), 1) and hope the add will be combined away.
I've also added a combine for the pattern AArch64 recognizes
(ctlz_zero_undef (or (shl (xor x, (sra x, bw-1)), 1), 1)).
I've only enabled the combines when the target has a Legal or Custom
action for the operation, taking into account type promotion. We
can relax this in the future by adding a default expansion to
LegalizeDAG and adding more type legalization rules.
libheif: update to 1.21.1.
Workaround for c++17 stopped working, now really needs c++20 (like
it said for a couple releases already).
Changes:
1.21.1
This patch release only fixes a build error with some GCC versions because of a missing #include.
If you didn't get any build error, you do not need this updated patch release.
1.21.0
This release adds full support for reading and writing HEIF image sequences.
libheif will now encode HEIF image sequences with all included codecs.
Since HEIF image sequences are very similar to MP4 videos, this new version is also capable of decoding most MP4 videos (without audio, of course).
heif-enc documentation for sequence encoding
[13 lines not shown]