[LoopUnroll] Fix freqs for unconditional latches: introduce tests (#191008)
This patch introduces all tests for PR #179520 but with current results
so that it is easier to see which results PR #179520 improves. This
patch should not land without PR #179520.
[AMDGPU] Use WriteSALUDummy for v_div_scale* (#191670)
This uses the new HWWriteRes for v_div_scale*
For an explanation of why we want to do this , see
https://github.com/llvm/llvm-project/pull/190095 . In short, the
scheduler will not try to cover the full latency of the instructions
without this new modelling.
For a clear example of this, see the changes to
llvm/test/CodeGen/AMDGPU/schedmodel-dummywrite.mir in the git log of
this PR.
[LoopUnroll] Record conditional latch info (#182403)
This patch makes no functional change and so introduces no new tests or
documentation, but it is not merely refactoring.
This patch gathers conditional latch info needed for PR #179520, which
fixes block frequencies when LoopUnroll converts a conditional latch in
an unrolled loop iteration to unconditional. Without PR #179520, this
patch is useless and should not land.
sysutils/ansible-sshjail: Fix at runtime with latest ansible
Import patch from upstream Pull request to allow it to work properly
with new ansible release.
Obtained from: https://github.com/austinhyde/ansible-sshjail/pull/46
[flang] Recognize non-addressable resources in FIR AA. (#191577)
Same as in #187423 change for CSE, we can assume that an effect
on a non-addressable resource cannot affect memory pointed to
by 'location'.
[SamplePGO] Optimize the basename matching logic for matching unused profiles (#191523)
This change optimizes the basename matching logic in
`SampleProfileMatcher::matchFunctionsWithoutProfileByBasename` by
replacing the existing O(N*M) nested loop with an O(N+M) hash-based
lookup, while strictly preserving the original matching semantics. The
previous implementation relied on a substring heuristic
(`ProfName.contains(BaseName)`) to bypass expensive demangling
operations during the nested iteration; however, in codebases with
common or overlapping function names, this heuristic frequently
evaluated to true, resulting in redundant demangling and quadratic time
complexity. The updated approach demangles each profile name exactly
once and utilizes a `StringMap` to perform O(1) lookups against the
orphan functions. This eliminates the need for the substring pre-check
while maintaining the exact same constraints: establishing a strict 1:1
mapping between orphaned IR functions and profile entries, and correctly
identifying and rejecting ambiguous matches where multiple entities
share the same demangled basename.
Results in a 9x speedup on a large module with common basenames.
textproc/rumdl: update to 0.1.71
v0.1.71
Added
config: support bare rule name in rumdl config get (8369fb1)
Fixed
config: move test module to end of flavor.rs to satisfy clippy items_after_test_module (e30714d)
config: load user config alongside discovered markdownlint project config (c0f79c1)
config: display global.flavor as lowercase quoted string in rumdl config output (b162295)
v0.1.70
Fixed
MD044: do not flag proper names inside bare-domain link text (56a45df)
v0.1.69
[9 lines not shown]
[libc][math] Refactor copysign family to header-only (#182137)
Refactors the copysign math family to be header-only.
Closes https://github.com/llvm/llvm-project/issues/182136
Target Functions:
- copysign
- copysignbf16
- copysignf
- copysignf128
- copysignf16
- copysignl
---------
Co-authored-by: bassiounix <muhammad.m.bassiouni at gmail.com>
Document RETURN value for timegm(3)
APIs with in-band errors that conflate the error with a legitimate return
value are about the worst you can get. Near and dear to my heart is the API
aptly described as "gibbering eidritch horror" by beck: ASN1_INTEGER_get(3).
Adapt the wording of its RETURN VALUES to timegm() and mktime(), for which
Dec 31, 1969 at 23:59:59 will yield the error return -1 and thereby errata.
Missing docs pointed out by claudio a while back and yesterday by deraadt
ok deraadt millert