LLVM/project ce435ddllvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp, llvm/test/CodeGen/AMDGPU coexec-sched-effective-stall.mir

Adrress comments from https://github.com/llvm/llvm-project/pull/188658

Change-Id: Ia94c567a753168c1ffa16dc5d91195e7dd0ba044
DeltaFile
+114-114llvm/test/CodeGen/AMDGPU/coexec-sched-effective-stall.mir
+3-3llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+117-1172 files

LLVM/project 2537596clang/include/clang/Basic DiagnosticLexKinds.td Module.h, clang/include/clang/Lex ModuleMap.h

[clang][modules] Diagnose headers owned by multiple modules (#188538)

Add -Wduplicate-header-ownership, an off by default warning that fires
at include time when a header is owned by multiple top-level modules.
This helps catch overlapping module maps that can cause confusing module
resolution.

Assisted-by: claude-opus-4.6
DeltaFile
+159-0clang/test/Modules/duplicate-header-ownership.c
+105-12clang/lib/Lex/ModuleMap.cpp
+23-4clang/include/clang/Lex/ModuleMap.h
+7-0clang/include/clang/Basic/DiagnosticLexKinds.td
+3-0clang/include/clang/Basic/Module.h
+297-165 files

LLVM/project 8d5a719clang/include/clang/ScalableStaticAnalysisFramework/Analyses/PointerFlow PointerFlow.h, clang/lib/ScalableStaticAnalysisFramework/Analyses SSAFAnalysesCommon.cpp

clean up code
DeltaFile
+45-66clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowExtractor.cpp
+14-13clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlow.cpp
+1-1clang/include/clang/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlow.h
+1-1clang/lib/ScalableStaticAnalysisFramework/Analyses/SSAFAnalysesCommon.cpp
+1-1clang/test/Analysis/Scalable/PointerFlow/tu-summary-serialization.test
+62-825 files

LLVM/project 561cf0cllvm/include/llvm Pass.h, llvm/include/llvm/IR PassTimingInfo.h

[NFC] Move TimePasses globals from Pass.h to PassTimingInfo.h (#192352)

They don't belong in the legacy pass manager-specific header, they apply
to both pass managers, plus the pass manager isn't the right layer to
put the bools anyway.
DeltaFile
+0-11llvm/include/llvm/Pass.h
+11-0llvm/include/llvm/IR/PassTimingInfo.h
+1-4llvm/lib/IRReader/IRReader.cpp
+1-0llvm/unittests/IR/TimePassesTest.cpp
+1-0llvm/lib/Target/AMDGPU/AMDGPUSplitModule.cpp
+1-0llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
+15-154 files not shown
+19-1510 files

LLVM/project 975fda5llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

Add comment

Change-Id: I2180bba631fe4a01ed3c3fbcfa8c19cbefa84133
DeltaFile
+1-0llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+1-01 files

LLVM/project 5be815allvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

clang-format

Change-Id: I534b1a979f55339a814ef3416c2492252845add5
DeltaFile
+6-3llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+6-31 files

LLVM/project f892036llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.h

Add a comment

Change-Id: I447f7f1fb185b18924cfd98249b5a0a05fef2484
DeltaFile
+7-0llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+7-01 files

LLVM/project 996914cllvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

Add back tryLatency

Change-Id: I12d4f255c48ed77ba927eb3b192e5903f1f5e24f
DeltaFile
+7-1llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+7-11 files

LLVM/project 17f284cllvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

Make fence heuristic work bottom-up

Change-Id: I629cbc8905b87a962e8b123287e5f60a3154df6b
DeltaFile
+19-17llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+19-171 files

LLVM/project 9cc5d20llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp AMDGPUCoExecSchedStrategy.h, llvm/test/CodeGen/AMDGPU coexec-sched-effective-stall.mir

[AMDGPU] Add MemoryPipeline scheduling to Coexec sched

Change-Id: I52c476834155823d1ba998cdbbcb3ad6a7e6f2f5
DeltaFile
+323-0llvm/test/CodeGen/AMDGPU/coexec-sched-effective-stall.mir
+77-23llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+18-0llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.h
+418-233 files

FreeBSD/src a85c4abshare/man/man4 appleir.4 Makefile, sys/conf files

appleir: Add Apple IR receiver driver

HID driver for Apple IR receivers (USB HID, vendor 0x05ac).
Supports Apple Remote and generic IR remotes using NEC protocol.

Supported hardware:
- Apple IR Receiver (0x8240, 0x8241, 0x8242, 0x8243, 0x1440)

Apple Remote protocol (proprietary 5-byte HID reports):
- Key down/repeat/battery-low detection
- 17-key mapping with two-packet command support
- Synthesized key-up via 125ms callout timer

Generic IR remotes (NEC protocol):
- Format: [0x26][0x7f][0x80][code][~code]
- Checksum: code + ~code = 0xFF
- Default keymap with 8 common codes
- See: https://techdocs.altium.com/display/FPGA/NEC+Infrared+Transmission+Protocol


    [9 lines not shown]
DeltaFile
+440-0sys/dev/hid/appleir.c
+93-0share/man/man4/appleir.4
+8-0sys/modules/hid/appleir/Makefile
+1-0share/man/man4/Makefile
+1-0sys/conf/files
+1-0sys/modules/hid/Makefile
+544-06 files

LLVM/project 48408f9llvm/lib/Target/AMDGPU AMDGPUCoExecSchedStrategy.cpp

Remove unused function

Change-Id: I9f2de1497f793d2848dedaf645e21e07a4ba82d6
DeltaFile
+0-60llvm/lib/Target/AMDGPU/AMDGPUCoExecSchedStrategy.cpp
+0-601 files

OpenBSD/ports 0ulO9Csnet/czds distinfo Makefile

   Update czds to 1.4.0.
VersionDeltaFile
1.4+2-2net/czds/distinfo
1.4+1-1net/czds/Makefile
+3-32 files

LLVM/project a3af640flang/include/flang/Evaluate tools.h, flang/lib/Evaluate tools.cpp

[flang][cuda] Avoid false positive on multi device symbol with components (#192177)

Semantic was wrongly flagging derived-type components as two device
resident object. Update how we collect symbols and count the number of
device resident object.
DeltaFile
+34-0flang/lib/Evaluate/tools.cpp
+26-0flang/test/Lower/CUDA/cuda-data-transfer.cuf
+11-0flang/include/flang/Evaluate/tools.h
+2-1flang/lib/Semantics/check-cuda.cpp
+73-14 files

Linux/linux aec2f68arch/s390/crypto des_s390.c, arch/sparc/crypto des_glue.c

Merge tag 'v7.1-p1' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6

Pull crypto update from Herbert Xu:
 "API:
   - Replace crypto_get_default_rng with crypto_stdrng_get_bytes
   - Remove simd skcipher support
   - Allow algorithm types to be disabled when CRYPTO_SELFTESTS is off

  Algorithms:
   - Remove CPU-based des/3des acceleration
   - Add test vectors for authenc(hmac(md5),cbc({aes,des})) and
     authenc(hmac({md5,sha1,sha224,sha256,sha384,sha512}),rfc3686(ctr(aes)))
   - Replace spin lock with mutex in jitterentropy

  Drivers:
   - Add authenc algorithms to safexcel
   - Add support for zstd in qat
   - Add wireless mode support for QAT GEN6
   - Add anti-rollback support for QAT GEN6

    [24 lines not shown]
DeltaFile
+1,764-0crypto/testmgr.h
+868-31drivers/crypto/ti/dthev2-aes.c
+0-831arch/x86/crypto/des3_ede-asm_64.S
+515-25drivers/crypto/intel/qat/qat_common/qat_comp_algs.c
+0-502arch/s390/crypto/des_s390.c
+0-482arch/sparc/crypto/des_glue.c
+3,147-1,871160 files not shown
+5,209-3,842166 files

LLVM/project f834a48clang/lib/CIR/CodeGen CIRGenCall.cpp, clang/test/CIR/CodeGen trivial-abi.cpp

[CIR][ABI] Handle callee-destructed params for trivial_abi (#191257)

Replace errorNYI for isParamDestroyedInCallee with working
implementation: create aggregate temp, mark externally destructed,
emit expr.  Unblocks [[trivial_abi]] types on Itanium ABI.

Adds trivial-abi.cpp test covering 17 cases from
CodeGenCXX/trivial_abi.cpp with CIR/LLVM/OGCG checks.

Made with [Cursor](https://cursor.com)
DeltaFile
+316-0clang/test/CIR/CodeGen/trivial-abi.cpp
+20-5clang/lib/CIR/CodeGen/CIRGenCall.cpp
+336-52 files

LLVM/project 0a4d3b3clang/test/CIR/CodeGen attr-noundef.cpp, clang/test/CIR/CodeGenCXX uncopyable-args.cpp x86_64-arguments.cpp

[CIR][ABI][NFC] Add x86_64 ABI parity tests (#191259)

Add three test files for CIR ABI parity on x86_64, all with
CIR/LLVM/OGCG checks:

- uncopyable-args.cpp — 24 functions covering non-copyable and
  move-only types (trivial, default-ctor, move-ctor, etc.)
- x86_64-arguments.cpp — 26 functions covering C++ struct passing,
  inheritance, member pointers, empty bases, packed structs
- attr-noundef.cpp — 26 functions covering noundef placement on
  structs, unions, vectors, member pointers, _BitInt

Made with [Cursor](https://cursor.com)
DeltaFile
+464-0clang/test/CIR/CodeGenCXX/uncopyable-args.cpp
+252-0clang/test/CIR/CodeGenCXX/x86_64-arguments.cpp
+235-0clang/test/CIR/CodeGen/attr-noundef.cpp
+951-03 files

Linux/linux 8801e23Documentation/security ipe.rst

Merge tag 'ipe-pr-20260413' of git://git.kernel.org/pub/scm/linux/kernel/git/wufan/ipe

Pull IPE update from Fan Wu:
 "A single commit from Evan Ducas that fixes several spelling and
  grammar mistakes in the IPE documentation. There are no functional
  changes"

* tag 'ipe-pr-20260413' of git://git.kernel.org/pub/scm/linux/kernel/git/wufan/ipe:
  docs: security: ipe: fix typos and grammar
DeltaFile
+5-5Documentation/security/ipe.rst
+5-51 files

LLVM/project b2af653clang/include/clang/CIR MissingFeatures.h, clang/lib/CIR/CodeGen CIRGenStmt.cpp CIRGenClass.cpp

[CIR][NFC] Convert MissingFeatures::requiresCleanups to errorNYI (#192350)

This change adds errorNYI calls in two places that we previously had
requiresCleanups() missing features markers, adds a more specific
missing feature marker for loops, removes one requiresCleanups() where
the handling was already implemented, and deletes a bunch of missing
feature markers there were never used.
DeltaFile
+4-4clang/lib/CIR/CodeGen/CIRGenStmt.cpp
+4-3clang/lib/CIR/CodeGen/CIRGenClass.cpp
+1-6clang/include/clang/CIR/MissingFeatures.h
+0-2clang/lib/CIR/CodeGen/CIRGenExprAggregate.cpp
+9-154 files

LLVM/project 5b979f5bolt/lib/Passes Instrumentation.cpp

[BOLT][Passes] use ADT containers for instrumentation spanning tree. (#192289)

Swap `std::unordered_map<…, std::set<…>>` for
`DenseMap<…, SmallVector<…>>` in `Instrumentation::instrumentFunction`
and switch read paths from `STOutSet[&BB]` to `find()`. This removes
per-set heap allocations, stops inserting empty buckets on every probe,
and replaces linear `is_contained()` scans over a red-black tree with
linear scans over inline `SmallVector` storage (most basic blocks have
at most a couple of spanning-tree out-edges). NFC.
DeltaFile
+12-7bolt/lib/Passes/Instrumentation.cpp
+12-71 files

Linux/linux 8396455arch/arm/xen enlighten.c, drivers/tty/hvc hvc_xen.c

Merge tag 'for-linus-7.1-rc1-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip

Pull xen updates from Juergen Gross:

 - fix an error path in drivers/xen/manage.c

 - fix the Xen console driver solving a boot hangup when the console
   backend isn't yet running

 - comment fix in the Xen swiotlb driver

 - hardening for Xen on Arm adding a more thorough validation

 - cleanup of the Xen grant table code hiding suspend/resume code for
   the case if CONFIG_HIBERNATE_CALLBACKS isn't defined

* tag 'for-linus-7.1-rc1-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip:
  xen/grant-table: guard gnttab_suspend/resume with CONFIG_HIBERNATE_CALLBACKS
  hvc/xen: Check console connection flag

    [3 lines not shown]
DeltaFile
+17-3drivers/xen/manage.c
+13-0include/xen/interface/io/console.h
+12-0include/xen/grant_table.h
+6-4arch/arm/xen/enlighten.c
+3-0drivers/tty/hvc/hvc_xen.c
+2-1drivers/xen/grant-table.c
+53-81 files not shown
+54-97 files

OpenBSD/ports pqIkG9bconverters/bdf2psf Makefile distinfo

   Update bdf2psf to 1.247.
VersionDeltaFile
1.52+2-2converters/bdf2psf/Makefile
1.47+2-2converters/bdf2psf/distinfo
+4-42 files

Linux/linux a5f9980Documentation/admin-guide/device-mapper verity.rst, drivers/md dm-verity-fec.c dm-crypt.c

Merge tag 'for-7.1/dm-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm

Pull device mapper updates from Benjamin Marzinski:
 "There are fixes for some corner case crashes in dm-cache and
  dm-mirror, new setup functionality for dm-vdo, and miscellaneous minor
  fixes and cleanups, especially to dm-verity.

  dm-vdo:
   - Make dm-vdo able to format the device itself, like other dm
     targets, instead of needing a userspace formating program
   - Add some sanity checks and code cleanup

  dm-cache:
   - Fix crashes and hangs when operating in passthrough mode (which
     have been around, unnoticed, since 4.12), as well as a late
     arriving fix for an error path bug in the passthrough fix
   - Fix a corner case memory leak

  dm-verity:

    [37 lines not shown]
DeltaFile
+174-210drivers/md/dm-verity-fec.c
+234-101drivers/md/dm-vdo/vdo.c
+211-10drivers/md/dm-vdo/encodings.c
+138-13drivers/md/dm-vdo/dm-vdo-target.c
+49-91drivers/md/dm-crypt.c
+102-20Documentation/admin-guide/device-mapper/verity.rst
+908-44567 files not shown
+1,351-86373 files

Linux/linux f1d26d7drivers/iommu/arm/arm-smmu-v3 arm-smmu-v3.c arm-smmu-v3.h, drivers/iommu/generic_pt iommu_pt.h

Merge tag 'iommu-updates-v7.1' of git://git.kernel.org/pub/scm/linux/kernel/git/iommu/linux

Pull iommu updates from Joerg Roedel:
 "Core:

   - Support for RISC-V IO-page-table format in generic iommupt code

  ARM-SMMU Updates:

   - Introduction of an "invalidation array" for SMMUv3, which enables
     future scalability work and optimisations for devices with a large
     number of SMMUv3 instances

   - Update the conditions under which the SMMUv3 driver works around
     hardware errata for invalidation on MMU-700 implementations

   - Fix broken command filtering for the host view of NVIDIA's "cmdqv"
     SMMUv3 extension


    [45 lines not shown]
DeltaFile
+800-156drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
+123-263drivers/iommu/riscv/iommu.c
+313-0drivers/iommu/generic_pt/fmt/riscv.h
+105-84drivers/iommu/generic_pt/iommu_pt.h
+136-6drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h
+135-0drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c
+1,612-50936 files not shown
+2,157-79642 files

LLVM/project eab567allvm/lib/Target/AMDGPU SIInstructions.td, llvm/test/CodeGen/AMDGPU sub.v2i16.ll add.v2i16.ll

[AMDGPU] Add true16 patterns for build_vector (vgpr, 0) (#192147)

It is shorter than VOP3 `and` instruction and in some cases
can save a second move.
DeltaFile
+8-10llvm/test/CodeGen/AMDGPU/sub.v2i16.ll
+8-10llvm/test/CodeGen/AMDGPU/add.v2i16.ll
+7-5llvm/test/CodeGen/AMDGPU/fcanonicalize.f16.ll
+4-8llvm/test/CodeGen/AMDGPU/flat-saddr-load.ll
+10-0llvm/lib/Target/AMDGPU/SIInstructions.td
+2-8llvm/test/CodeGen/AMDGPU/divergence-driven-buildvector.ll
+39-414 files not shown
+48-5010 files

Linux/linux 5a69195Documentation/devicetree/bindings/ata baikal,bt1-ahci.yaml, drivers/ata libata-transport.c libata-scsi.c

Merge tag 'ata-7.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/libata/linux

Pull ata updates from Niklas Cassel:

 - Misc code cleanups related to tag checking and tag command completion
   (Damien)

 - Remove Baikal bt1-ahci DT binding since the upstreaming for this SoC
   is not going to be finalized (Andy)

 - Only call the libata port error handler from the SCSI error handler
   if there were command timeouts or if EH was scheduled for the port
   (Damien)

 - Refactor ata_scsiop_maint_in() to more clearly show that there is
   only one service action implemented for the MAINTENANCE IN command
   (me)

 - Clean up the handling of sysfs attributes exposed by libata (Heiner)

    [32 lines not shown]
DeltaFile
+92-146drivers/ata/libata-transport.c
+117-110drivers/ata/libata-scsi.c
+0-115Documentation/devicetree/bindings/ata/baikal,bt1-ahci.yaml
+0-55drivers/ata/ahci_dwc.c
+15-18drivers/ata/libahci_platform.c
+20-10drivers/ata/libata-eh.c
+244-4547 files not shown
+265-48413 files

LLVM/project d430d89clang/lib/CIR/Dialect/Transforms LoweringPrepare.cpp, clang/test/CIR/CodeGenCUDA device-stub.cu

[CIR][CUDA] Do Runtime Kernel Registration (#188926)

Related:
https://github.com/issues/assigned?issue=llvm%7Cllvm-project%7C179278,
https://github.com/llvm/llvm-project/issues/175871

More registration shenanigans -> Generates `__cuda_register_globals`
that associates the fatbin with kernels that contain `__global__`
qualifiers with the runtime.

Generated equivalent runtime code:

``` C
  // Called once per kernel to register it with the CUDA runtime.
  void __cuda_register_globals(void **fatbinHandle) {
      __cudaRegisterFunction(
          fatbinHandle,
          (const char *)&_Z25__device_stub__kernelfunciii, // host-side stub ptr
          (char *)__cuda_kernelname_str,                   // device-side mangled name

    [13 lines not shown]
DeltaFile
+119-2clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp
+28-2clang/test/CIR/CodeGenCUDA/device-stub.cu
+147-42 files

LLVM/project 5dc1fd4clang/lib/CIR/CodeGen CIRGenCall.cpp, clang/test/CIR/CodeGen amdgpu-call-addrspace-cast.cpp

[CIR] Add address space casts for pointer arguments when creating a call (#192303)

This patch checks if the expected type for an argument is the same as
the actual type. If types are pointers but with different address spaces
then add an address space cast to make the pointer types match.

Assised-by: Cursor / Claude Opus 4.6
DeltaFile
+47-0clang/test/CIR/CodeGen/amdgpu-call-addrspace-cast.cpp
+12-0clang/lib/CIR/CodeGen/CIRGenCall.cpp
+59-02 files

LLVM/project bbc6a54mlir/include/mlir/Dialect/XeGPU/IR XeGPUOps.td, mlir/lib/Dialect/XeGPU/IR XeGPUOps.cpp

[MLIR][XeGPU] Remove create tdesc & update offset op from xegpu dialect (#182804)

This PR removes create_tdesc and update_offset ops from the XeGPU
dialect, as scatter load/store/prefetch now accept memref+offsets
directly.
DeltaFile
+132-300mlir/test/Dialect/XeGPU/invalid.mlir
+1-287mlir/test/Dialect/XeGPU/ops.mlir
+14-219mlir/include/mlir/Dialect/XeGPU/IR/XeGPUOps.td
+1-202mlir/test/Dialect/XeGPU/xegpu-unroll-patterns.mlir
+0-168mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp
+1-150mlir/test/Dialect/XeGPU/xegpu-blocking.mlir
+149-1,32612 files not shown
+187-1,75818 files

OpenZFS/src 1644e2fmodule/zfs dbuf.c, tests/runfiles common.run

Fix read corruption after block clone after truncate

When copy_file_range overwrites a recent truncation, subsequent reads
can incorrectly determine that it is read hole instead of reading the
cloned blocks.

This can happen when the following conditions are met:
- Truncate adds blkid to dn_free_ranges
- A new TXG is created
- copy_file_range calls dmu_brt_clone which override the block pointer
  and set DB_NOFILL
- Subsequent read, given DB_NOFILL, hits dbuf_read_impl and
  dbuf_read_hole
- dbuf_read_hole calls dnode_block_freed, which returns TRUE because the
  truncated blkids are still in dn_free_ranges

This will not happen if the clone and truncate are in the same TXG,
because the block clone would update the current TXG's dn_free_ranges,
which is why this bug only triggers under high IO load (such as

    [16 lines not shown]
DeltaFile
+117-0tests/zfs-tests/cmd/clone_after_trunc.c
+31-0tests/zfs-tests/tests/functional/block_cloning/block_cloning_after_trunc.ksh
+5-1module/zfs/dbuf.c
+2-1tests/runfiles/common.run
+2-0tests/zfs-tests/cmd/Makefile.am
+1-0tests/zfs-tests/tests/Makefile.am
+158-23 files not shown
+161-29 files