[Driver] Enable -ftime-trace for CUDA/HIP device compilation (#179701)
Previously, -ftime-trace only generated trace files for host compilation
when compiling CUDA/HIP code. Device compilation was excluded because
the OffloadingPrefix was non-empty, causing handleTimeTrace() to be
skipped.
This patch enables -ftime-trace for offload device compilation by:
1. Passing the offloading prefix to handleTimeTrace()
2. Including the bound architecture in the trace filename
3. Deriving the trace output directory from the -o option for device
compilation (since the device output is a temp file)
Trace files are now generated for each offload target:
- Host: output.json
- Device: output-hip-amdgcn-amd-amdhsa-gfx906.json
Note: When using --save-temps, multiple compilation phases (preprocess,
compile, codegen) write to the same trace file, with each phase
[3 lines not shown]
[CUDA/HIP][SYCL] Deduplicate deferred diagnostics across multiple callers (#185926)
[CUDA/HIP][SYCL] Deduplicate deferred diagnostics across multiple
callers
Deferred diagnostics for a function were emitted once per caller that
forced the function into device context. When multiple device functions
called the same host-device function containing errors, the diagnostics
were repeated for each caller, producing noisy duplicate output.
Change the deferred diagnostic emission to a two-pass approach:
1. During the call graph walk, collect callers in DeviceKnownEmittedFns
(now storing multiple callers per function) and mark functions that
need diagnostics, but don't emit yet.
2. After the walk completes, emit diagnostics once per function with
all callers listed as notes.
Call chain notes now use "called by" for the first caller in each chain
and "which is called by" for subsequent callers in the chain, making it
[5 lines not shown]
[lldb] Nominate myself for FreeBSD maintainer (#180053)
I've been FreeBSD src contributor since 2022 and started working for the
FreeBSD Foundation starting from January. I created and have been
actively working on #180061 to achieve [LLDB improvement on
FreeBSD](https://wiki.freebsd.org/SummerOfCodeIdeas#Improve_LLDB_on_FreeBSD)
which was originally Google Summer of Code idea.
The initial LLDB support for userspace and kernel debugging on FreeBSD
was initiated by Moritz system back in 2020, but there are some missing
bits like architecture support due to lack of time. Due to this, FreeBSD
developers still depend on GDB-derived kernel debugger and it still
remains de-facto tier 1 debugger (e.g. no scripts in source tree for
lldb).
To track what/how I'm working on this task, please take a look at my
foundation [status report on LLDB
improvement](https://github.com/FreeBSDFoundation/status-updates/blob/main/Minsoo_Choo/kdb.md).
As you see, I will also improve Lua scripting on LLDB as Lua remains
[13 lines not shown]
[flang][cuda][openacc] Fix interoperability when host_data is not generated (#186516)
Update lowering to not process operands if the operation will not be
generated because of a `if(.false.)` clause. This was breaking when
enabling CUDA Fortran on the same test.
[NFC][sanitizer] Accept ETIMEDOUT in getpwnam_r_invalid_user.cpp (#186538)
On some systems, looking up an result in a timeout.
Error here is not a sign of compiler-rt issue.
Fixes flakiness on
https://lab.llvm.org/buildbot/#/builders/sanitizer-ppc64le-linux
[RISCV] Add codegen patterns to support short forward branches with immediates (#185643)
This is a follow-up to #182456. This PR adds support for short forward
branches where branches are from Qualcomm uC `Xqcibi` extension.
[libclc][NFC] Rename three .inc files to avoid name conflicts (#186384)
Follow-up of 9b96ebc. There are binary_def.inc and unary_def.inc in
header directory.
- clc_ep.inc -> clc_ep_decl.inc
- relational/binary_def.inc -> relational/relational_binary_def.inc
- relational/unary_def.inc -> relational/relational_unary_def.inc
[NFC][Support] Don't test UB in Caching.WriteAfterCommit (#186532)
The test expects crash after commit essentially null-dereferencing.
Just check that it's nullptr directly.
Fixes asan/ubsan buildbot.
[clang][DirectX] Specify element-aligned vectors in TargetInfo (#185954)
Add a bit to TargetInfo to specify that vectors are element-aligned
rather than naturally aligned. This is needed to match DirectX's Data
Layout in LLVM.
Note that this removes the `Opts.HLSL` early exit from
`checkDataLayoutConsistency` so that we actually get these checks when
compiling HLSL. This check looks like it was put there because of
similarity between OpenCL and HLSL, but it isn't actually necessary.
Resolves #123968
[libc] Reference the proper namespaced variables in the GPU header
Summary:
These linked to the extern "C" versions which did not exist in test
builds.
[ScalarEvolution] Limit recursion in getRangeRef for PHI nodes. (#152823)
Restrict PHI nodes that getRangeRef is allowed to recursively examine so
we don't need a "visited" set. And fix createSCEVIter so it creates all
the relevant SCEV nodes before getRangeRef tries to examine them.
The tests that are affected have induction variables that aren't
AddRecs. (Other cases are theoretically affected, but don't seem to show
up in our tests.)
Fix callee type generation (#186272)
The callee_type metadata is expected to be a list of generalized type
metadata by the IR verifier. But for indirect calls with internal
linkage the type metadata is just an integer. Avoid including them in
callee_type metadata.
This will reduce the precision of the generated call graph as the edges to internal linkage functions whose address were taken will not be present anymore. We need to handle this in the future.
[lldb] Enable SanitizersAllocationTraces=tagged in darwin-mte-launcher (#186326)
Collect allocation traces for tagged memory when using the
`darwin-mte-launcher` to help debug MTE crashes.
[lldb] Add support for the darwin-mte-launcher to lldb-dotest (#186319)
Add support for the `darwin-mte-launcher` to `lldb-dotest` when LLDB is
configured to run the tests under MTE.
Remove unicode character from AttrDocs.td (#186521)
PR #185225 introduced a single unicode character, which is the only
unicode character in this file. Change this to a ASCII/Latin1 letter.
[LoopIdiomVectorize] Preserve address space in FindFirstByte (#185226)
Fixes #185188
Use SearchStart->getType() instead of Builder.getPtrTy() so that
pointer-typed PHI nodes preserve the address space of the original
pointers.
Assisted-by: Claude (Anthropic)
[flang] Fix SELECT TYPE in OpenACC construct (#186511)
A routine in Semantics/resolve-directives.cpp was overwriting a symbol
table pointer in a parse tree Name, thereby removing the AssocEntity
with the correct type for a TYPE IS or CLASS IS clause that had been
placed there. I don't really understand why resolve-directives has to
overwrite symbol table pointers in the first place, but it definitely
shouldn't be replacing these.