LLVM/project f4fb73allvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime' into users/meinersbur/flang_runtime_shared
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project bed5b1fllvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_move-files' into users/meinersbur/flang_runtime
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project a8c87d7llvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_flang_rt' into users/meinersbur/flang_runtime_move-files
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project 1f3d71ellvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_FLANG_INCLUDE_RUNTIME' into users/meinersbur/flang_runtime_flang_rt
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project b68700bllvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_Testing' into users/meinersbur/flang_runtime_FLANG_INCLUDE_RUNTIME
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project f007594llvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_FortranSupport' into users/meinersbur/flang_runtime_Testing
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8514,704 files

LLVM/project a519f98llvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'users/meinersbur/flang_runtime_FortranDecimal' into users/meinersbur/flang_runtime_FortranSupport
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,698 files not shown
+229,000-108,8504,704 files

LLVM/project f2abcdellvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Merge branch 'main' into users/meinersbur/flang_runtime_FortranDecimal
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+233-6,579llvm/test/DebugInfo/NVPTX/debug-info.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+19,871-21,1214,699 files not shown
+229,007-108,8504,705 files

LLVM/project df1b617clang/include/clang/Basic BuiltinsX86.td, llvm/test/CodeGen/AMDGPU dagcombine-fmul-sel.ll

Merge branch 'main' into users/zhaoqi5/pre-commit-tlsle-mergebaseoffset-tests
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+5,378-4clang/include/clang/Basic/BuiltinsX86.td
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+3,130-1,856llvm/test/CodeGen/AMDGPU/dagcombine-fmul-sel.ll
+25,717-13,9925,312 files not shown
+294,216-135,0845,318 files

LLVM/project 81234f6clang-tools-extra/clangd/unittests HeuristicResolverTests.cpp, lldb/unittests/SymbolFile/DWARF DWARFDIETest.cpp

rebase

Created using spr 1.3.4
DeltaFile
+1,610-1,545llvm/test/DebugInfo/NVPTX/debug-info.ll
+876-875llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
+682-644llvm/test/Transforms/LoopVectorize/RISCV/interleaved-accesses.ll
+640-0lldb/unittests/SymbolFile/DWARF/DWARFDIETest.cpp
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-542clang-tools-extra/clangd/unittests/HeuristicResolverTests.cpp
+4,439-3,606663 files not shown
+19,930-9,451669 files

LLVM/project 239bae2clang-tools-extra/clangd/unittests HeuristicResolverTests.cpp, lldb/unittests/SymbolFile/DWARF DWARFDIETest.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+1,610-1,545llvm/test/DebugInfo/NVPTX/debug-info.ll
+876-875llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
+682-644llvm/test/Transforms/LoopVectorize/RISCV/interleaved-accesses.ll
+640-0lldb/unittests/SymbolFile/DWARF/DWARFDIETest.cpp
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-542clang-tools-extra/clangd/unittests/HeuristicResolverTests.cpp
+4,439-3,606663 files not shown
+19,930-9,451669 files

LLVM/project 11cc681clang-tools-extra/clangd/unittests HeuristicResolverTests.cpp, lldb/unittests/SymbolFile/DWARF DWARFDIETest.cpp

update

Created using spr 1.3.4
DeltaFile
+1,610-1,545llvm/test/DebugInfo/NVPTX/debug-info.ll
+876-875llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
+682-644llvm/test/Transforms/LoopVectorize/RISCV/interleaved-accesses.ll
+640-0lldb/unittests/SymbolFile/DWARF/DWARFDIETest.cpp
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-542clang-tools-extra/clangd/unittests/HeuristicResolverTests.cpp
+4,439-3,606664 files not shown
+19,948-9,451670 files

LLVM/project 9dcd3edclang-tools-extra/clangd/unittests HeuristicResolverTests.cpp, lldb/unittests/SymbolFile/DWARF DWARFDIETest.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.4

[skip ci]
DeltaFile
+1,610-1,545llvm/test/DebugInfo/NVPTX/debug-info.ll
+876-875llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
+682-644llvm/test/Transforms/LoopVectorize/RISCV/interleaved-accesses.ll
+640-0lldb/unittests/SymbolFile/DWARF/DWARFDIETest.cpp
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-542clang-tools-extra/clangd/unittests/HeuristicResolverTests.cpp
+4,439-3,606664 files not shown
+19,934-9,448670 files

LLVM/project eab9b11llvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Rebase, address comments

Created using spr 1.3.5
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+2,263-2,208llvm/test/MC/Disassembler/AMDGPU/gfx12_dasm_vopc.txt
+21,901-16,7503,291 files not shown
+182,635-84,8913,297 files

LLVM/project f1a1be0llvm/test/CodeGen/RISCV/rvv vfma-vp.ll, llvm/test/CodeGen/X86 vector-interleaved-load-i8-stride-8.ll

Rebase

Created using spr 1.3.5
DeltaFile
+4,839-5,345llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll
+3,056-4,201llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-8.ll
+6,647-0llvm/test/MC/AMDGPU/gfx12_asm_vop3c_dpp16-fake16.s
+2,667-2,586llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vopc.txt
+2,429-2,410llvm/test/MC/Disassembler/AMDGPU/gfx11_dasm_vop3_from_vopc.txt
+2,263-2,208llvm/test/MC/Disassembler/AMDGPU/gfx12_dasm_vopc.txt
+21,901-16,7504,167 files not shown
+210,098-98,5164,173 files

LLVM/project 9c7987fllvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/GlobalOpt resolve-fmv-ifunc.ll

Merge branch 'main' into users/rampitec/01-16-_amdgpu_add_test_for_valu_hoisiting_from_wwm_region._nfc
DeltaFile
+682-644llvm/test/Transforms/LoopVectorize/RISCV/interleaved-accesses.ll
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+365-0llvm/test/Transforms/GlobalOpt/resolve-fmv-ifunc.ll
+259-1llvm/test/Transforms/LoopVectorize/AArch64/sve-interleaved-accesses.ll
+252-0llvm/test/Transforms/LoopVectorize/AArch64/sve-interleaved-masked-accesses.ll
+175-0llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+2,364-64549 files not shown
+4,240-85755 files

LLVM/project 3b3590allvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/InstCombine fsqrtdiv-transform.ll

Revert "Revert "[InstCombine] Transform high latency, dependent FSQRT/FDIV into FMUL"" (#123313)

Reverts llvm/llvm-project#123289
DeltaFile
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+176-0llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+807-02 files

LLVM/project ba06cf8llvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/InstCombine fsqrtdiv-transform.ll

Revert "Revert "[InstCombine] Transform high latency, dependent FSQRT/FDIV in…"

This reverts commit 606d0a7cdc0c551df754eb4494a2c16861b6a9b9.
DeltaFile
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+176-0llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+807-02 files

LLVM/project 606d0a7llvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/InstCombine fsqrtdiv-transform.ll

Revert "[InstCombine] Transform high latency, dependent FSQRT/FDIV into FMUL" (#123289)

Reverts llvm/llvm-project#87474
DeltaFile
+0-631llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-176llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+0-8072 files

LLVM/project b2a7179llvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/InstCombine fsqrtdiv-transform.ll

Revert "[InstCombine] Transform high latency, dependent FSQRT/FDIV into FMUL …"

This reverts commit 7253c6fde498c4c9470b681df47d46e6930d6a02.
DeltaFile
+0-631llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+0-176llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+0-8072 files

LLVM/project 7253c6fllvm/lib/Transforms/InstCombine InstCombineMulDivRem.cpp, llvm/test/Transforms/InstCombine fsqrtdiv-transform.ll

[InstCombine] Transform high latency, dependent FSQRT/FDIV into FMUL (#87474)

The proposed patch, in general, tries to transform the below code
sequence:
x = 1.0 / sqrt (a);
r1 = x * x;  // same as 1.0 / a
r2 = a / sqrt(a); // same as sqrt (a)

TO

(If x, r1 and r2 are all used further in the code) 
r1 = 1.0 / a
r2 = sqrt (a)
x = r1 * r2

The transform tries to make high latency sqrt and div operations
independent and also saves on one multiplication.

The patch was tested with SPEC17 suite with cpu=neoverse-v2. The

    [6 lines not shown]
DeltaFile
+631-0llvm/test/Transforms/InstCombine/fsqrtdiv-transform.ll
+176-0llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+807-02 files