[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

vivekkhandelwal1 · 2025-01-09T09:19:37Z

No description provided.

Signed-off-by: Vivek Khandelwal <[email protected]>

pashu123

LGTM! Are there any reasons why we aren't decomposing this torch op into another set of torch ops?

vivekkhandelwal1 · 2025-01-13T04:26:13Z

LGTM! Are there any reasons why we aren't decomposing this torch op into another set of torch ops?

AFAIK, the reason for not doing that is since we want the attention op as a single kernel. Hence, we just lower it to tm_tensor.attention and the rest is taken care of during the codegen.

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op

c4b0c18

Signed-off-by: Vivek Khandelwal <[email protected]>

vivekkhandelwal1 requested review from rsuderman, AmosLewis, zjgarvey and pashu123 January 9, 2025 09:19

Add comment for reference implementation

26d246c

pashu123 approved these changes Jan 9, 2025

View reviewed changes

vivekkhandelwal1 requested a review from Groverkss January 16, 2025 13:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

vivekkhandelwal1 commented Jan 9, 2025

pashu123 left a comment

vivekkhandelwal1 commented Jan 13, 2025

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op #3950

Are you sure you want to change the base?

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op #3950

Conversation

vivekkhandelwal1 commented Jan 9, 2025

pashu123 left a comment

Choose a reason for hiding this comment

vivekkhandelwal1 commented Jan 13, 2025

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950