LLVM/project 741a6f2clang/lib/AST ItaniumMangle.cpp

Revert "[Clang][ItaniumMangle][NFC] Refactor FunctionTypeDepthState (#196240)"

This reverts commit a2643354db14aaf892519e98bf450c3fc3701dfe.
DeltaFile
+42-28clang/lib/AST/ItaniumMangle.cpp
+42-281 files

LLVM/project 3e20d3dllvm/lib/Analysis DependenceAnalysis.cpp

[DA] Delete early return in accumulateCoefficientsGCD (NFCI) (#197935)

This patch resolved one TODO comment in `accumulateCoefficientsGCD`
regarding an early return. I think this early return doesn't change the
final result because:

- The presence/absence of this early return can only affect whether
`CurLoopCoeff` is set.
- Regardless the value of `CurLoopCoeff`, if `RunningGCD` equals 1, the
result of caller side while loop doesn't change.

Deleting this early return is somewhat beneficial, because it allows us
to merge `analyzeCoefficientsForGCD` into this function.
DeltaFile
+0-5llvm/lib/Analysis/DependenceAnalysis.cpp
+0-51 files

LLVM/project 813f2abllvm/test/CodeGen/Thumb2 mve-clmul.ll, llvm/test/CodeGen/X86 vector-replicaton-i1-mask.ll

Merge branch 'main' into users/kasuga-fj/da-resolve-todo-acc-gcd
DeltaFile
+1,243-8,768llvm/test/CodeGen/X86/vector-replicaton-i1-mask.ll
+3,699-3,716llvm/test/CodeGen/Thumb2/mve-clmul.ll
+0-4,752llvm/test/tools/llvm-mca/RISCV/SiFiveP800/vlseg-vsseg.s
+4,549-0llvm/test/tools/llvm-mca/RISCV/SiFiveP800/rvv/arithmetic.test
+3,729-0llvm/test/tools/llvm-mca/RISCV/SiFiveP800/rvv/fp.test
+3,149-0llvm/test/tools/llvm-mca/RISCV/SiFiveP800/rvv/vlseg-vsseg.test
+16,369-17,236850 files not shown
+69,835-48,750856 files

LLVM/project bb2d0d6llvm/include/llvm/Analysis MemoryBuiltins.h, llvm/include/llvm/IR InstrTypes.h

[MemoryBuiltins][NFC] Allow users to retrieve detailed (de)allocation info

There are some helpers to inspect a value or call but not all
information about the (de)allocation are made available outside of
MemoryBuiltins.cpp. The two new functions allow users a more in-depth
view of (de)allocations through a single API. To help with this, we now
read the alloc_align attribute to provide better alignment information
to users. alloc-family is used as well. Two new helpers provide argument
numbers, rather than values.
DeltaFile
+97-33llvm/lib/Analysis/MemoryBuiltins.cpp
+42-0llvm/include/llvm/Analysis/MemoryBuiltins.h
+10-3llvm/lib/IR/Instructions.cpp
+4-0llvm/include/llvm/IR/InstrTypes.h
+153-364 files

LLVM/project 59fbb7dllvm/lib/Analysis MemoryBuiltins.cpp

[MemoryBuiltins] Consistently infer and use MallocFamily

MallocFamily (the enum and StringRef) are used alongside AllocFnsTy.
The latter is picked up from the tables while the former is encoded in
the IR. While they should be merged at some point (see TODO), this
commit makes sure we consistently initialize the MallocFamily String and
pass it to users.
DeltaFile
+28-12llvm/lib/Analysis/MemoryBuiltins.cpp
+28-121 files

LLVM/project 13442edllvm/include/llvm/Analysis MemoryBuiltins.h, llvm/include/llvm/IR InstrTypes.h

[MemoryBuiltins][NFC] Allow users to retrieve detailed (de)allocation info

There are some helpers to inspect a value or call but not all
information about the (de)allocation are made available outside of
MemoryBuiltins.cpp. The two new functions allow users a more in-depth
view of (de)allocations through a single API. To help with this, we now
read the alloc_align attribute to provide better alignment information
to users. alloc-family is used as well. Two new helpers provide argument
numbers, rather than values.
DeltaFile
+97-33llvm/lib/Analysis/MemoryBuiltins.cpp
+42-0llvm/include/llvm/Analysis/MemoryBuiltins.h
+10-3llvm/lib/IR/Instructions.cpp
+4-0llvm/include/llvm/IR/InstrTypes.h
+153-364 files

LLVM/project 8f740a3clang-tools-extra/clang-tidy/misc StaticInitializationCycleCheck.cpp, clang-tools-extra/test/clang-tidy/checkers/misc static-initialization-cycle.cpp

[clang-tidy] Fix crash in misc-static-initialization-cycle (#198155)

This commit fixes `misc-static-initialization-cycle` crashing on `catch
(...)`.

Catch-all handlers have no exception declaration, so traversal of
`CXXCatchStmt` can call `TraverseDecl(nullptr)`. The check previously
passed that null pointer to `DeclContext::containsDecl`. This commit
fixes the problem by adding a null guard.

Closes #198150
DeltaFile
+8-0clang-tools-extra/test/clang-tidy/checkers/misc/static-initialization-cycle.cpp
+1-1clang-tools-extra/clang-tidy/misc/StaticInitializationCycleCheck.cpp
+9-12 files

LLVM/project d191c2allvm/include/llvm/Analysis MemoryBuiltins.h, llvm/lib/Analysis MemoryBuiltins.cpp

[MemoryBuiltins][NFC] Clang format and fixed coding style
DeltaFile
+75-75llvm/lib/Analysis/MemoryBuiltins.cpp
+1-1llvm/include/llvm/Analysis/MemoryBuiltins.h
+76-762 files

LLVM/project e557242flang/lib/Semantics check-omp-structure.cpp check-omp-structure.h

[flang][OpenMP] Simplify checks for type-parameter inquiry

Remove the no longer needed IsDataRefTypeParamInquiry.
DeltaFile
+23-47flang/lib/Semantics/check-omp-structure.cpp
+2-1flang/lib/Semantics/check-omp-structure.h
+25-482 files

LLVM/project 3d3f4becompiler-rt/lib/sanitizer_common/tests sanitizer_stackdepot_test.cpp

[compiler-rt] Fix StackDepot benchmark thread barrier (#197633)

Use Param.Threads (number of worker threads) as barrier threshold
instead of Param.UniqueThreads (boolean that controls input generation).

This also silences
[-Wbool-integral-comparison](https://github.com/llvm/llvm-project/pull/194180)
warning I'm working on.
DeltaFile
+2-2compiler-rt/lib/sanitizer_common/tests/sanitizer_stackdepot_test.cpp
+2-21 files

LLVM/project 938211bllvm/lib/Transforms/Vectorize LoopVectorizationPlanner.cpp LoopVectorize.cpp

[LV] Move isMoreProfitable to LoopVectorizationPlanner.cpp (NFC). (#195269)

isMoreProfitable does not depend on anything in LoopVectorize.cpp, move
it to the recently added LoopVectorizationPlanner.cpp.

PR: https://github.com/llvm/llvm-project/pull/195269
DeltaFile
+90-0llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.cpp
+0-90llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+90-902 files

LLVM/project 6a14487llvm/lib/Target/AArch64 AArch64LowerHomogeneousPrologEpilog.cpp AArch64.h

[NewPM] Add newpm port for AArch64LowerHomogeneousPrologEpilog (#197606)
DeltaFile
+33-17llvm/lib/Target/AArch64/AArch64LowerHomogeneousPrologEpilog.cpp
+7-1llvm/lib/Target/AArch64/AArch64.h
+1-1llvm/lib/Target/AArch64/AArch64TargetMachine.cpp
+1-0llvm/lib/Target/AArch64/AArch64PassRegistry.def
+42-194 files

LLVM/project 3a1206allvm/test/Transforms/LoopVectorize epilog-iv-select-cmp.ll

[LV] Add find-last-iv test with blend (NFC). (#198214)

Add test cases showing functional change for
https://github.com/llvm/llvm-project/pull/194729.
DeltaFile
+78-2llvm/test/Transforms/LoopVectorize/epilog-iv-select-cmp.ll
+78-21 files

LLVM/project 5f2bedcclang/include/clang/AST ASTContext.h, clang/include/clang/Basic Builtins.td

Add clang warning if fp exception functions are called without appropriate flags/pragmas (#187860)

Fixes https://github.com/llvm/llvm-project/issues/128239

The implementation adds warnings for floating-point exception function
calls (fenv.h) made without enabling floating-point exception behavior
via `-ffp-exception-behavior=maytrap/strict` or `#pragma STDC
FENV_ACCESS ON`. To support recognition of all fenv.h builtins,
`fexcept_t` and `fenv_t` were added as builtin types.
DeltaFile
+68-0clang/test/Sema/fenv-access.c
+55-0clang/include/clang/Basic/Builtins.td
+51-0clang/test/Sema/builtin-fenv.c
+36-0clang/lib/Serialization/ASTReader.cpp
+34-1clang/include/clang/AST/ASTContext.h
+35-0clang/test/Sema/fenv-access-implicit.c
+279-114 files not shown
+424-220 files

LLVM/project c5c8e91llvm/lib/Transforms/Instrumentation IndirectCallPromotion.cpp

[ICP] Update comment about duplicate values in VP MD

With https://github.com/llvm/llvm-project/pull/196649 deduplicating VP
values and https://github.com/llvm/llvm-project/pull/193083 enforcing
this, we no longer need to worry about duplicate values, zero or
otherwise. Update the comment to reflect this.

Reviewers: mingmingl-llvm, teresajohnson

Reviewed By: mingmingl-llvm

Pull Request: https://github.com/llvm/llvm-project/pull/198140
DeltaFile
+3-3llvm/lib/Transforms/Instrumentation/IndirectCallPromotion.cpp
+3-31 files

LLVM/project 46c1fa8llvm/lib/Transforms/Vectorize LoopVectorize.cpp

[VPlan] Use ResumeForEpilogue to get epilogue vector trip count (NFC). (#198210)

Use ResumeForEpilogue to look up the vector trip count instead of plain
IR lookup. Also prepares for non-phi resume values.
DeltaFile
+26-23llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+26-231 files

LLVM/project 856f7d4llvm/lib/Transforms/Vectorize LoopVectorize.cpp

[LV] Extract helper to simplify phi removal in connectEpiVectorL (NFC) (#198203)

Extract the repeated edge-redirect + DomTree update pattern into a
RedirectEdge lambda, and convert the separate removeIncomingValue calls
for check blocks into a loop.
DeltaFile
+21-31llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+21-311 files

LLVM/project 1745344clang/test/CodeGen/AArch64 neon-perm.c, clang/test/CodeGen/AArch64/neon perm.c

rebase

Created using spr 1.3.7
DeltaFile
+1,250-1,357llvm/test/CodeGen/X86/avx512-calling-conv.ll
+203-915llvm/test/CodeGen/X86/vector-compress.ll
+158-868llvm/test/CodeGen/X86/avx512-ext.ll
+154-866llvm/test/CodeGen/X86/avx512-mask-op.ll
+492-120clang/test/CodeGen/AArch64/neon/perm.c
+0-383clang/test/CodeGen/AArch64/neon-perm.c
+2,257-4,509144 files not shown
+4,879-5,730150 files

LLVM/project e78a23cclang/test/CodeGen/AArch64 neon-perm.c, clang/test/CodeGen/AArch64/neon perm.c

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+1,250-1,357llvm/test/CodeGen/X86/avx512-calling-conv.ll
+203-915llvm/test/CodeGen/X86/vector-compress.ll
+158-868llvm/test/CodeGen/X86/avx512-ext.ll
+154-866llvm/test/CodeGen/X86/avx512-mask-op.ll
+492-120clang/test/CodeGen/AArch64/neon/perm.c
+0-383clang/test/CodeGen/AArch64/neon-perm.c
+2,257-4,509144 files not shown
+4,879-5,730150 files

LLVM/project df90525llvm/lib/Transforms/Scalar JumpTableToSwitch.cpp

[JTS] Readd assertion

Now that VP metadata has been cleaned up a little bit, we can reenable
this assertion.

Reviewers: alexander-shaposhnikov, mtrofin

Pull Request: https://github.com/llvm/llvm-project/pull/198141
DeltaFile
+3-4llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
+3-41 files

LLVM/project 94584e7llvm/lib/Transforms/Scalar JumpTableToSwitch.cpp

rebase

Created using spr 1.3.7
DeltaFile
+3-4llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
+3-41 files

LLVM/project 56ef779llvm/lib/Transforms/Scalar JumpTableToSwitch.cpp

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+3-4llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
+3-41 files

LLVM/project c497efbllvm/lib/Transforms/InstCombine InstCombineSelect.cpp, llvm/test/Transforms/InstCombine logical-select.ll

[InstCombine] Convert logical and/or with trunc nuw to i1 into bitwise ops (#198178)

if it is know that `trunc nuw to i1 ` can not be poison logical and/or
can be folded to bitwise ops.

proof https://alive2.llvm.org/ce/z/xQ2Sj-
DeltaFile
+55-0llvm/test/Transforms/InstCombine/logical-select.ll
+13-5llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
+68-52 files

LLVM/project f8803b0libcxx/include/__cxx03/__memory uninitialized_algorithms.h, libcxx/include/__memory uninitialized_algorithms.h

[libc++] Require the exact assignment expression to be trivial in __uninitialized_allocator_copy_impl

__uninitialized_allocator_copy_impl has an optimization that replaces allocator_traits::construct with std::copy for raw pointer ranges when the element type is trivially copy constructible and trivially copy assignable.

The copy-assignment trait only checks whether assignment from const T& is trivial. That is weaker than the expression used by std::copy, which evaluates *out = *in. If overload resolution selects a different non-trivial assignment operator for that expression, std::copy can call that operator on uninitialized storage.

Check is_trivially_assignable<_Out&, _In&> instead in both header copies. This matches the assignment expression used by std::copy, preserves the optimized path when that assignment is actually trivial, and avoids making non-const raw pointer callers select the generic allocator_traits::construct overload due to a qualification conversion.

Add a vector copy-constructor regression test with a type whose defaulted copy assignment is trivial but whose templated assignment operator is selected for non-const lvalue sources.

Tested with:
build-libcxx-fresh/bin/llvm-lit -q libcxx/test/std/containers/sequences/vector/vector.cons/copy.pass.cpp libcxx/test/libcxx/memory/uninitialized_allocator_copy.pass.cpp
build-libcxx-fresh/bin/llvm-lit -q --param std=c++03 libcxx/test/libcxx/memory/uninitialized_allocator_copy.pass.cpp
build-libcxx-fresh/bin/llvm-lit -q --param std=c++20 libcxx/test/std/containers/sequences/vector/vector.cons/copy.pass.cpp
build-libcxx-fresh/bin/llvm-lit -q --param std=c++11 libcxx/test/std/containers/sequences/vector/vector.cons/copy.pass.cpp
DeltaFile
+76-1libcxx/test/std/containers/sequences/vector/vector.cons/copy.pass.cpp
+1-1libcxx/include/__cxx03/__memory/uninitialized_algorithms.h
+1-1libcxx/include/__memory/uninitialized_algorithms.h
+78-33 files

LLVM/project 0e92b55compiler-rt/lib/sanitizer_common sanitizer_platform_limits_solaris.cpp

[sanitizer_common] Fix sanitizer_platform_limits_solaris.cpp compilation (#198158)

When switching `clang++` to the default Solaris 11.4 compilation
environment, XPG7 + extensions, `sanitizer_platform_limits_solaris.cpp`
fails to compile:

```
compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_solaris.cpp:93:53: error: use of undeclared identifier 'ucontext_t'; did you mean 'ucontext_t_sz'?
   93 |   unsigned ucontext_t_sz(void *ctx) { return sizeof(ucontext_t); }
      |                                                     ^~~~~~~~~~
      |                                                     ucontext_t_sz
compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_solaris.cpp:93:12: note: 'ucontext_t_sz' declared here
   93 |   unsigned ucontext_t_sz(void *ctx) { return sizeof(ucontext_t); }
      |            ^
compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_solaris.cpp:93:52: error: invalid application of 'sizeof' to a function type
   93 |   unsigned ucontext_t_sz(void *ctx) { return sizeof(ucontext_t); }
      |                                                    ^~~~~~~~~~~~

```

    [4 lines not shown]
DeltaFile
+1-0compiler-rt/lib/sanitizer_common/sanitizer_platform_limits_solaris.cpp
+1-01 files

LLVM/project 598acfbcompiler-rt/test/asan/TestCases/Posix coverage-module-unloaded.cpp, compiler-rt/test/sanitizer_common/TestCases get_module_and_offset_for_pc.cpp

[sanitizer][test] Fix coverage-module-unloaded.cpp etc. on Solaris (#198164)

When switching `clang++` to the default Solaris 11.4 compilation
environment, XPG7 + extensions, two tests `FAIL`:

```
  AddressSanitizer-i386-sunos :: TestCases/Posix/coverage-module-unloaded.cpp
  AddressSanitizer-i386-sunos-dynamic :: TestCases/Posix/coverage-module-unloaded.cpp

  SanitizerCommon-asan-i386-SunOS :: get_module_and_offset_for_pc.cpp
  SanitizerCommon-ubsan-i386-SunOS :: get_module_and_offset_for_pc.cpp
  SanitizerCommon-ubsan-x86_64-SunOS :: get_module_and_offset_for_pc.cpp
```

The failure mode is the same in both cases: the tests fail to link with
`main` undefined. This happens because `<sys/mman.h>` defines

```
#define SHARED          0x10

    [13 lines not shown]
DeltaFile
+3-3compiler-rt/test/asan/TestCases/Posix/coverage-module-unloaded.cpp
+2-2compiler-rt/test/sanitizer_common/TestCases/get_module_and_offset_for_pc.cpp
+5-52 files

LLVM/project 3a2876dmlir/lib/Conversion/MathToSPIRV MathToSPIRV.cpp, mlir/test/Conversion/MathToSPIRV math-to-gl-spirv.mlir

[mlir][SPIR-V] Fix math.powf lowering for non-integer exponents (#197727)

The ConvertFToS usage only works when y is an integer. Use it only for
integer constants, for others: lower as GL.Exp(y * GL.Log(x))
DeltaFile
+72-64mlir/lib/Conversion/MathToSPIRV/MathToSPIRV.cpp
+79-26mlir/test/Conversion/MathToSPIRV/math-to-gl-spirv.mlir
+151-902 files

LLVM/project 47e142bllvm/docs ReleaseNotes.md, llvm/lib/Target/AArch64 AArch64ISelLowering.cpp

[AArch64] Fix handling of x29/x30 in inline assembly clobbers (#167783)

The AArch64 backend was silently ignoring inline assembly clobbers when
numeric register names (x29, x30) were used instead of their
architectural aliases (fp, lr). I found this bug via inline assembly
in Zig, which not normalize the register names the way clang does.

There is an incoplete workaround for this in Rust, but that only
handles `x30/lr`, not `x29/fp`. I thought it would make
sense to fix this properly rather than adding a workaround to Zig.

This patch adds explicit handling in getRegForInlineAsmConstraint() to
map both numeric and alias forms to the correct physical registers,
following the same pattern used by the RISC-V backend.

I've left `x31/sp` without changes, it would nice to have to have
warning when trying to clobber `x31`, just like there is for `sp`,
but that register needs different handling, so it's best done
separately.

    [24 lines not shown]
DeltaFile
+44-0llvm/test/CodeGen/AArch64/inline-asm-clobber-x29-x30.ll
+15-0llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
+5-0llvm/docs/ReleaseNotes.md
+64-03 files

LLVM/project d19af7fllvm/utils/lit/lit TestingConfig.py

add comment

Created using spr 1.3.7
DeltaFile
+1-0llvm/utils/lit/lit/TestingConfig.py
+1-01 files

LLVM/project 3597ec5llvm/utils/lit/lit TestingConfig.py

[𝘀𝗽𝗿] changes introduced through rebase

Created using spr 1.3.7

[skip ci]
DeltaFile
+1-0llvm/utils/lit/lit/TestingConfig.py
+1-01 files