Resolve merge/04d4be501dc83fe411193a46c10e898898552731 stable 21.x #11029

jkorous-apple · 2025-07-18T23:46:43Z

No description provided.

This analysis currently just crashes when applied to a graph region that has a use-def cycle. This PR fixes that by keeping track of the operations the DFS has already visited when following use-def edges and stopping once we visit an operation again.

Commit a629322 forced the register class of ZPR[24]StridedOrContiguous for spills/fills of ZPR2 and ZPR4, but this may result in issues when the regclass for the fill is a ZPR2/ZPR4 which would allow the register allocator to pick `z1_z2`, which is not a supported register for ZPR2StridedOrContiguous that only supports tuples of the form (strided) `z0_z8`, `z1_z9` or (contiguous, start at multiple of 2) `z0_z1`, `z2_z3`. For spills we could add a new register class that supports any of the tuple forms, but I've decided to use two pseudos similar to the fills for consistency. Fixes llvm#148655

llvm#148824) By finalizing the bundle _after_ copying over the implicit-ops, it also adds any implicit-defs to the BUNDLE. Fixes llvm#148645

This sets the cache line size to 64 for the Neoverse V2 and V3. I've tested this with loop-interchange: it doesn't result in extra compile-times, but it does enable a lot more interchange.

…ddrRegImm9. (llvm#148779) To fold a FrameIndex, we need to teach eliminateFrameIndex to respect the uimm9 range. (cherry picked from commit 63d099a)

The transformation done in llvm#147349 was incorrect since we were not passing the input node of the `OR` instruction to the `QC.INSBI` instruction leading to the generated instruction doing the wrong thing. In order to do this we first needed to add the output register to `QC.INSBI` as being both an input and output. The code produced after the above fix will need a copy (mv) to preserve the register input to the OR instruction if it has more than one use making the transformation net neutral ( `6-byte QC.E.ORI/ORAI` vs `2-byte C.MV + 4-byte QC.INSB`I). Avoid doing the transformation if there is more than one use of the input register to the OR instruction. (cherry picked from commit d67d91a)

Happened to spot this while looking at libclang.map for other reasons. clang_visitCXXMethods was added in LLVM 21, not LLVM 20. (cherry picked from commit 116110e)

tru and others added 18 commits July 15, 2025 15:59

Bump version to 21.1.0-git

6296ebd

Merge commit '6296ebd45d3f' from llvm.org/release/21.x into stable/21.x

2d58182

Merge commit '18624ae54bc9' from llvm.org/release/21.x into stable/21.x

85a88f8

Merge commit '588b8130794f' from llvm.org/release/21.x into stable/21.x

62e1484

[AArch64] Ensure bundle expansion of MOVPRFX gets correct implicit ops (

d1517ec

llvm#148824) By finalizing the bundle _after_ copying over the implicit-ops, it also adds any implicit-defs to the BUNDLE. Fixes llvm#148645

Merge commit 'd1517ec62222' from llvm.org/release/21.x into stable/21.x

a5695c9

[AArch64] Set the cache line size to 64 for the V2 and V3. (llvm#148213)

7d803c8

This sets the cache line size to 64 for the Neoverse V2 and V3. I've tested this with loop-interchange: it doesn't result in extra compile-times, but it does enable a lot more interchange.

Merge commit '7d803c868ab9' from llvm.org/release/21.x into stable/21.x

b4f0637

[Frontend][OpenMP] Move isPrivatizingClause to OMP.h, NFC (llvm#148644)

a0895b4

Merge commit 'a0895b4581ba' from llvm.org/release/21.x into stable/21.x

c9e8865

[RISCV] Remove incorrect and untested FrameIndex support from SelectA…

49722f1

…ddrRegImm9. (llvm#148779) To fold a FrameIndex, we need to teach eliminateFrameIndex to respect the uimm9 range. (cherry picked from commit 63d099a)

Merge commit '49722f1df1ef' from llvm.org/release/21.x into stable/21.x

7ecf20b

Merge commit 'b71c9a436641' from llvm.org/release/21.x into stable/21.x

e797e3c

[libclang] Fix version for symbol clang_visitCXXMethods (llvm#148958)

04d4be5

Happened to spot this while looking at libclang.map for other reasons. clang_visitCXXMethods was added in LLVM 21, not LLVM 20. (cherry picked from commit 116110e)

Merge commit '3cb0c7f45b97' from llvm.org/main into next

804240d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Resolve merge/04d4be501dc83fe411193a46c10e898898552731 stable 21.x #11029

Resolve merge/04d4be501dc83fe411193a46c10e898898552731 stable 21.x #11029

Uh oh!

jkorous-apple commented Jul 18, 2025

Uh oh!

Uh oh!

Resolve merge/04d4be501dc83fe411193a46c10e898898552731 stable 21.x #11029

Are you sure you want to change the base?

Resolve merge/04d4be501dc83fe411193a46c10e898898552731 stable 21.x #11029

Uh oh!

Conversation

jkorous-apple commented Jul 18, 2025

Uh oh!

Uh oh!