OS-8724 - Reserve vendor-data for SmartOS/Triton use
Reviewed by: Dan McDonald <danmcd at edgecast.io>
Reviewed by: Nahum Shalman <nahum.shalman at edgecast.io>
Approved by: Dan McDonald <danmcd at edgecast.io>
devel/hp48xgcc: try to fix a race condition during the build
Declare as(1) and ld(1) targets' dependency on the `libLKV.a'
and use it literally instead of the phony `lib' target.
Reported by: pkg-fallout
[SPIRV] Enforce integer return types for GetDimensions intrinsics
Update the `GetDimensions` intrinsics in the SPIR-V backend to strictly
return integer types (i32 or vectors of i32), aligning with the HLSL
design document for GetDimensions lowering. The intrinsics are no longer
overloaded on the return type.
As a result, instruction selection in `SPIRVInstructionSelector.cpp`
has been simplified by removing the logic that handles floating-point
returns and generates `OpConvertUToF` instructions. Floating-point
conversions are now expected to be handled by Clang codegen.
Updates:
- IntrinsicsSPIRV.td: Replaced `llvm_any_ty` with explicit integer
and integer vector types (`llvm_i32_ty`, `llvm_v2i32_ty`, etc.).
- SPIRVInstructionSelector.cpp: Removed floating-point type checks and
`OpConvertUToF` handling for GetDimensions.
- GetDimensions.ll: Updated to use the correct integer-only intrinsic
names without return type suffixes.
[2 lines not shown]
[SPIRV] Implement the int_spv_resource_calculate_lod* IntrinsicsSPIRV (#188337)
Implements intrinsics used to get the level-of-detail given a texture,
sampler, and a coordinate. It will be used to implement the
corresponding HLSL methods.
Assisted-by: Gemini
unbound: limit duckdb to a single thread in write mode to reduce logger memory usage
After some testing, it seems the duckdb python API has a tendency to
leak (up to a cap), which seems to be mitigated when we limit to a single thread.
Further testing shows that the single logger.py process doesn't
use parallelization when appending dataframes to the db instance
and as such multiple threads are useless to begin with. The heavier
actions are the read actions, which are separate short-lived processes
and do not suffer from the same issue and likely do require duckdb
parallelism to perform properly, so apply the single thread only to
writers.
Treat `''` and `""` in Gnu-tokenized response files as an empty string argument. (#187566)
This matches the behavior of gcc and also fixes an inconsistency with
the way the same arguments would be parsed by the shell. For example,
this command line passed directly on the shell:
`command -option1 '' -option2`
has three arguments: `"-option1"`, `""`, and `"-option2"`. However, if
these are passed in a Gnu-tokenized response file today:
```
-option1
''
-option2
```
The `''` is discarded and the command incorrectly has two arguments:
`"-option1"` and `"-option2"`.
[VectorCombine] Fix transitive Uses in foldShuffleToIdentity (#188989)
The Uses in foldShuffleToIdentity is intended to detect where an operand
is used to distinguish between splats, identities and concats of the
same value. When looking through multiple unsimplified shuffles the same
Use could be both a splat and a identity though. This patch changes the
Use to a Value and an original Use, so that even if we are looking
through multiple vectors we recognise the splat vs identity vs concat of
each use correctly.
Fixes #180338
NAS-139738 / 27.0.0-BETA.1 / Accept the new cert organization name in `renew_certs` (#18604)
Follows the organization name change in our self-signed certs. Accept
the old name "iXsystems" for existing certs, then the new name
"iXsystems Inc. dba TrueNAS" once they are renewed.
Also replace an old example email in the API docs.
Tests
http://jenkins.eng.ixsystems.net:8080/job/tests/job/api_tests/8283/
SROA: Fix tree merge IRBuilder insert point (#189680)
StoreInfos is sorted by slice offset, not program order. Anchoring the
IRBuilder at StoreInfos.back() could emit shufflevectors before SSA
values defined later in the same block (invalid IR).
Insert merged shuffles immediately before TheLoad when the load shares
the store block. When the load is elsewhere, insert before the store
block terminator so the merge runs after every store + any trailing
instructions in that block.
[LLVM][Intrinsics] Refactor IIT encoding generation (#189790)
Refactor IIT encoding generation. The core change here is that when
generating IIT encodings, we pre-generate all the bits of the IIT
encoding except cases where a type needs to encode its own overload
index, which is patched in later in `TypeInfoGen`. In addition, this
change introduces a class hierarchy for dependent types, so that the
checks in `TypeInfoGen` are more meaningful, and renames/simplifies
several other pieces of code, as listed below.
1. Change the encoding for IIT_ARG's ArgInfo byte to encode the overload
slot index in lower 5 bits and the argument kind in upper 3 bits. This
enabled generating the same packed format for all other dependent types
that need to encode an overload slot index in the IIT encoding. Adjusted
the corresponding C++ code in `IITDescriptor::getArgumentNumber` and
`IIT_Descriptor::getArgumentKind`.
2. Introduce more descriptive classes to handle packing of the overload
index + arg kind into the IIT encoding. `OverloadIndexPlaceholder` is
used to generate a transient value in the type-signature that is patched
[24 lines not shown]
Retry "[SDAG] (abs (add nsw a, -b)) -> (abds a, b) (#175801)" (#186659)
A better version of #175801 . see that for more info.
Fixes #185467
The original patch was checking the correctness of the transformation
based on the original Op1 , which was then negated (in the case of
IsAdd). This patch fixes that issue by inverting the sign bit in that
case.
Also pushed a slight nfc there to simplify the code and remove some
duplication.
alive2 proofs:
abds: https://alive2.llvm.org/ce/z/oJQPss
abdu: https://alive2.llvm.org/ce/z/HfPF5q
[2 lines not shown]
[X86][DAG] remove LowerFCanonicalize (#188127)
Remove LowerFCanonicalize. Added fallback for cases when the scalar type also has its Custom lowering to avoid regressions on AMDGPU and SystemZ.
Fixes #143862
[mlir][memref] Fold memref.reinterpret_cast operations with valid offset or size constants. (#189533)
When encountering an invalid offset or size, we only skip the current
invalid value and continue attempting to fold other valid offsets or
sizes.
[HLSL] Add GetDimensions to Texture2D.
This commit add the GetDimensions mehtods to Texture2D. For DXIL, it
requires intrinsics that are not yet available. They are added, but not
implemented.
Assisted-by: Gemini
[HLSL] Add CalculateLevelOfDetail methods to Texture2D
This adds the CalculateLevelOfDetail and CalculateLevelOfDetailUnclamped
methods to Texture2D using the establish pattern used for other methods.
Assisted-by: Gemini