[lldb][Process/FreeBSDKernelCore] Remove interactive mode check (#187981)
`debugger.GetCommandInterpreter().IsInteractive()` doesn't necessarily
mean that the debugger is running without `-b` or `--batch` flag. This
caused an issue where the message isn't printed in any case. Remove the
check so the message is always printed for now.
Fixes: 9d9c7fc00bdddd72e78b19e9fbb6af9d07c2218b (#178027)
Signed-off-by: Minsoo Choo <minsoochoo0122 at proton.me>
[AMDGPU][SIInsertWaitcnts] Fix counterOutOfOrder() check for LOAD_CNT (#187758)
This is a follow-up to @piotrAMD's comment in #178511 .
`hasPendingEvent()` used to return an unsigned mask but this changed in
#178511 when this function started to return bool. This changed the
functionality of `counterOutOfOrder()` which was still using an unsigned
mask that was erroneously being assigned a boolean instead of the
expected mask.
[flang][NFC] Converted five tests from old lowering to new lowering (part 38) (#188151)
Tests converted from test/Lower/Intrinsics: pack.f90, parity.f90,
popcnt.f90, poppar.f90, present.f90
AMDGPU: Codegen for v_dual_dot2acc_f32_f16/bf16 from VOP3
For V_DOT2_F32_F16 and V_DOT2_F32_BF16 add pre-RA register allocation
hints to preferably assign dst and src2 to the same physical register.
When the hint is satisfied, canMapVOP3PToVOPD recognises the instruction
as eligible for VOPD pairing by checking if it is VOP2 like:
dst==src2, no source modifiers, no clamp, and src1 is a register.
Mark both instructions as commutable to allow a literal in src1 to be
moved to src0, since VOPD only permits a literal in src0.
[clangd] Add background index path remapping tool (#185482)
Introduce a standalone tool to remap paths inside clangd background
index files. This allows users to pay the cost of generating a full
background index once, then reuse the background index across multiple
clients. Each client rewrites the background index in place after
copying the original, specifying the path(s) to be remapped.
This is an alternative approach to the dynamic path remapping proposed
in #180285.
Fixes clangd/clangd#847
Assisted-by: claude
AMDGPU: Implememt memsize forms of isLoadFromStackSlot/isStoreToStackSlot
Requested in #182673, though I'm not sure why this needs to be pushed
into targets. The size can be taken from the machine mem operand generically.
AMDGPU/GlobalISel: RegBankLegalize rules for cube intrinsics (#187762)
Add RegBankLegalize rules for cube* intrinsics. Also update manual tests
to use autogenerated checks.
[X86] getFauxShuffleMask - decode insert_subvector(undef, x, c) as shuffle if c != 0 (#188258)
Move existing combineX86ShufflesRecursively handling into
getFauxShuffleMask to allow other shuffle folds to make use of it