[libclc] Enable LLVM_RUNTIME_TARGETS in build system (#189892)
libclc target is now passed in from LLVM_RUNTIME_TARGETS.
The old configure flow based on `-DLLVM_ENABLE_RUNTIMES=libclc` is
deprecated because libclc no longer has a default target.
`-DLLVM_ENABLE_RUNTIMES=libclc -DLLVM_RUNTIME_TARGETS="<target-triple>"`
still works but it is considered legacy.
The new standard build requires:
Each target must now be selected explicitly on the CMake command line
through the runtimes target-specific cache entry and
LLVM_RUNTIME_TARGETS.
For example:
-DRUNTIMES_amdgcn-amd-amdhsa-llvm_LLVM_ENABLE_RUNTIMES=libclc
-DLLVM_RUNTIME_TARGETS="amdgcn-amd-amdhsa-llvm"
-DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=libclc
-DLLVM_RUNTIME_TARGETS="nvptx64-nvidia-cuda"
-DRUNTIMES_clspv--_LLVM_ENABLE_RUNTIMES=libclc
[17 lines not shown]
[LSR] Use TTI to check if zero-start IV is free in getSetupCost (#190587)
This avoids a downstream regression where LSR prefers {-1,+1}.
When constant zero typically doesn't require preheader initialization
(queried via TTI::getIntImmCost), consider it as free in getSetupCost.
Three test changes are improvements: amx-across-func.ll,
2011-11-29-postincphi.ll and pr62660-normalization-failure.ll.
Other test changes are neutral.
[libclc] Refine generic __clc_get_sub_group_size with fast full sub-group path (#188895)
Add a fast path for the common case that total work-group size is
multiple of max sub-group size.
The fallback path is ported from amdgpu/workitem/clc_get_sub_group_size.cl.
Compiler can generate predicated instructions for the fallback path to
avoid branches.
crypto/openssl: update from 3.0.16 to 3.0.20
### Description
This particular change contains all functional and security fixes made
between 3.0.16 and 3.0.20, with the net-resulting security issues being
the ones addressed in 3.0.20 [1] [2].
This is a direct commit to stable/14 as stable/15 and later use OpenSSL
3.5, as of writing, and thus this change could not have been merged
through main and stable/15 since the 3.0.17+ was released after :main
switched to OpenSSL 3.5.
This does not contain any merge commits as any MFV changes are banned
unless they go through :main.
### Process used when updating the component
1. Run the subtree merge command in a staging tree.
[12 lines not shown]
drm/amd/display: Fix DCE LVDS handling
From Alex Deucher
24ce568f7553a55404cddf8f0fca2a7c0ea9451c in linux-6.18.y/6.18.22
90d239cc53723c1a3f89ce08eac17bf3a9e9f2d4 in mainline linux
drm/amd/pm: disable OD_FAN_CURVE if temp or pwm range invalid for smu v13
From Yang Wang
ba9be472a438ea348f0fffaaebf541960ee643db in linux-6.18.y/6.18.22
3e6dd28a11083e83e11a284d99fcc9eb748c321c in mainline linux
drm/amdgpu/pm: drop SMU driver if version not matched messages
From Alex Deucher
2b8ae82d85d75347984297ed60d5005622992f6b in linux-6.18.y/6.18.22
a3ffaa5b397f4df9d6ac16b10583e9df8e6fa471 in mainline linux
drm/amdgpu: Change AMDGPU_VA_RESERVED_TRAP_SIZE to 64KB
From Donet Tom
77c918eaa4c916751769242567407f61c6af142a in linux-6.18.y/6.18.22
4487571ef17a30d274600b3bd6965f497a881299 in mainline linux
drm/amdgpu: validate doorbell_offset in user queue creation
From Junrui Luo
3543005a42d7e8e12b21897ef6798541bf7cbcd3 in linux-6.18.y/6.18.22
a018d1819f158991b7308e4f74609c6c029b670c in mainline linux
drm/amdgpu: Fix wait after reset sequence in S4
From Lijo Lazar
1e130cf9cd172c684ff471d9cd8bccf06db64d88 in linux-6.18.y/6.18.22
daf470b8882b6f7f53cbfe9ec2b93a1b21528cdc in mainline linux
drm/i915/dp: Use crtc_state->enhanced_framing properly on ivb/hsw CPU eDP
From Ville Syrjala
d0fd0bea847526dbe3d862c01e98a1617538565d in linux-6.18.y/6.18.22
9c9a57e4e337f94e23ddf69263fd0685c91155fb in mainline linux
drm/i915/dsi: Don't do DSC horizontal timing adjustments in command mode
From Ville Syrjala
86e926b108880c0109b8635e459450447156aeb7 in linux-6.18.y/6.18.22
4dfce79e098915d8e5fc2b9e1d980bc3251dd32c in mainline linux
drm/amd/display: Fix NULL pointer dereference in dcn401_init_hw()
From Srinivasan Shanmugam
10c13c111d0d7f8e101c742feff264fc98e3f9f7 in linux-6.18.y/6.18.22
e927b36ae18b66b49219eaa9f46edc7b4fdbb25e in mainline linux
drm/ioc32: stop speculation on the drm_compat_ioctl path
From Greg Kroah-Hartman
f0e441be08a2eab10b2d06fccfa267ee599dd6b3 in linux-6.18.y/6.18.22
f8995c2df519f382525ca4bc90553ad2ec611067 in mainline linux
Revert "drm: Fix use-after-free on framebuffers and property blobs when calling drm_dev_unplug"
From Maarten Lankhorst
89c3a4ba60309cc6333a8f3940a1866fc21a9000 in linux-6.18.y/6.18.22
45ebe43ea00d6b9f5b3e0db9c35b8ca2a96b7e70 in mainline linux
pfr_pool_get() in call to pfr_prepare_network()must use af
instead of hardcoded AF_INET
Issues reported and patch submitted by:
Renaud Allard <renaud () allard ! it>
OK sashan@
pf_frag_compare() should not be using subtraction to compare
fragment IDs
Issues reported and patch submitted by:
Renaud Allard <renaud () allard ! it>
OK sashan@
Fix error handling in pf_sourcelim_add()
There are two issues in pf_sourcelim_add() error path:
- stale pointers are left in pf_sourcelim_id_tree_inactive when
duplicate source limiter instance is found in pf_sourcelim_nm_tree
- overload table may leak when insertion of new source limiter instance
fails.
Both issues reported and patch submitted by:
Renaud Allard <renaud () allard ! it>
OK sashan@
gpart: More nuance for GPT support
A careful reading of the GPT standard shows that one may have fewer than
128 entries in your GPT table. While the standard requires that we
reserve enough space (32 512-byte-LBAs or 4 4096-byte-LBAs), it also
explicitly allows one to specify fewer actual partitions (since that
controls what is in the CRC). It requires that the first LBA to be 32
(512 sectors) or 6 (4k sectors) or larger. That requirement is not
enforced (it's not listed as one of validation criteria for the GPT).
We should likely do so in the future.
To that end, allow a default number of entries to use (defent) on
creation to be different (larger) than the minimum number of legal
entries. For gpt, these numbers work out to 128 and 1 respectively. For
all the others, make minent == defent so this is a nop for those
partitioning schemes.
Sponsored by: Netflix
Reviewed by: zlei, emaste
[3 lines not shown]
gpart: Add warning when the start sector is too low.
Add a warning if the starting sector is too low. The standard requires
that at least 16k is reserved for the GPT Partition Array, but some
tools produce GPT images with fewer than the required number of reserved
sectors.
PR: 274312
Sponsored by: Netflix
Differential Revision: https://reviews.freebsd.org/D42247
(cherry picked from commit 2cbda736cea8f82cfc5caab0f6099f0fbfe28537)