Support for candidate generation and tuning attention by keshavvinayak01 · Pull Request #2743 · nod-ai/amd-shark-ai

keshavvinayak01 · 2025-12-22T18:26:00Z

No description provided.

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

github-actions · 2025-12-22T18:28:50Z

Coverage report

This PR does not seem to contain any modification to coverable code.

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01

@bangtianliu It's a WIP, but if you could go through the changes and see if they make sense.

All these changes were required to make the boo_tuner work with attention, as well as generate more and better candidates for benchmarking.

Groverkss · 2026-01-16T13:56:43Z

Nice! I like the changes. Will redirect to @bangtianliu for review.

amdsharktuner/amdsharktuner/common.py

bangtianliu · 2026-01-16T16:27:19Z

amdsharktuner/amdsharktuner/libtuner.py

+        # For attention ops, use VectorDistribute pipeline instead of TileAndFuse
+        if dispatch_tuner.get_dispatch_kind() == common.DispatchKind.attention:
+            if args.codegen_pipeline != CodegenPipelines.llvmgpu_vector_distribute:
+                logging.info(
+                    f"Attention operation detected. Overriding codegen pipeline "
+                    f"from {args.codegen_pipeline} to llvmgpu_vector_distribute"
+                )
+                args.codegen_pipeline = CodegenPipelines.llvmgpu_vector_distribute
+


This code is unnecessary. Since we already have code to handle this case

amd-shark-ai/amdsharktuner/amdsharktuner/constraint_generator.py

Lines 653 to 658 in cab3ead

if (

dispatch_kind != common.DispatchKind.attention

or codegen_pipeline

!= iree_codegen.DispatchLoweringPassPipeline.LLVMGPUVectorDistribute

):

return []

The referenced code acts like a guard instead of returning solutions. We should either change that code to overwrite vector-distribute for the attention case or keep my code.

amdsharktuner/amdsharktuner/dispatch_constraints.py

amdsharktuner/amdsharktuner/libtuner.py

bangtianliu · 2026-01-16T16:46:26Z

Left some comments here.

Can do another pass once this PR is ready for review

Extract duplicated bf16/f16 with f32 accumulator compatibility logic from common.py and dispatch_constraints.py into a shared helper function. Use isinstance() for type comparison instead of str(). Revert benchmark log level from info back to debug. Add test for the new helper. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Fix denorm_flushing type to list[bool] with default [False] and use local variable for attention pipeline selection instead of mutating args. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

…xed accumulator type matching in MMA compatibility check Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 added 2 commits December 22, 2025 04:34

Hack VectorDistribute for attention ops

d51b3fc

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Rough assert removals for running SDA

2073190

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 added 4 commits January 16, 2026 10:36

Added changes to support accumulation MMA types; denorm flushing

a9d5c71

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Added support for solutions w/o Promote Q

33ee04e

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Handled error expecting Path object

ef5fc57

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Added SDPA example for tuner

fc9eca8

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 changed the title ~~Draft work.~~ [WIP] Support for candidate generation and tuning attention Jan 16, 2026

keshavvinayak01 commented Jan 16, 2026

View reviewed changes

Groverkss requested review from RattataKing and bangtianliu January 16, 2026 13:56

bangtianliu requested changes Jan 16, 2026

View reviewed changes

bangtianliu requested a review from kuhar January 16, 2026 16:50

keshavvinayak01 changed the title ~~[WIP] Support for candidate generation and tuning attention~~ [WIP] [Do not Review] Support for candidate generation and tuning attention Feb 6, 2026

keshavvinayak01 changed the title ~~[WIP] [Do not Review] Support for candidate generation and tuning attention~~ [WIP] Support for candidate generation and tuning attention Feb 18, 2026

keshavvinayak01 and others added 3 commits February 18, 2026 10:41

Restored logging changes

7fceab8

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Added better test command with default args

a365df9

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 changed the title ~~[WIP] Support for candidate generation and tuning attention~~ Support for candidate generation and tuning attention Feb 18, 2026

keshavvinayak01 and others added 3 commits February 18, 2026 11:14

Merge branch 'main' into users/keshavvinayak01/benchmarking-boo-sdpa

7810198

Always include VectorDistribute tuners for attention ops and use rela…

dc789fb

…xed accumulator type matching in MMA compatibility check Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Run black formatter on PR-affected Python files

c7a017b

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 marked this pull request as ready for review February 18, 2026 12:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Support for candidate generation and tuning attention#2743

Support for candidate generation and tuning attention#2743
keshavvinayak01 wants to merge 13 commits intomainfrom
users/keshavvinayak01/benchmarking-boo-sdpa

keshavvinayak01 commented Dec 22, 2025

Uh oh!

github-actions bot commented Dec 22, 2025 •

edited

Loading

Uh oh!

keshavvinayak01 left a comment

Uh oh!

Groverkss commented Jan 16, 2026

Uh oh!

Uh oh!

bangtianliu Jan 16, 2026 •

edited

Loading

Uh oh!

keshavvinayak01 Feb 18, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bangtianliu commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if (
	dispatch_kind != common.DispatchKind.attention
	or codegen_pipeline
	!= iree_codegen.DispatchLoweringPassPipeline.LLVMGPUVectorDistribute
	):
	return []

Comments

Conversation

keshavvinayak01 commented Dec 22, 2025

Uh oh!

github-actions bot commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report

Uh oh!

keshavvinayak01 left a comment

Choose a reason for hiding this comment

Uh oh!

Groverkss commented Jan 16, 2026

Uh oh!

Uh oh!

bangtianliu Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

keshavvinayak01 Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bangtianliu commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Dec 22, 2025 •

edited

Loading

bangtianliu Jan 16, 2026 •

edited

Loading