[core] Refactor hub attn kernels #12475

sayakpaul · 2025-10-13T10:14:52Z

What does this PR do?

Refactors how we load the attention kernels from the Hub.

Currently, when a user specifies the DIFFUSERS_ENABLE_HUB_KERNELS env var, we always download the supported kernel. Currently, we have FA3, but we have ongoing PRs that support FA and SAGE: #12387 and #12439. So, we will download ALL of them even when they're not required. This is not good.

This PR makes it so that only the relevant kernel gets downloaded without breaking torch.compile compliance (fullgraph and no recompilation triggers).

Cc: @MekkCyber

HuggingFaceDocBuilderDev · 2025-10-13T10:22:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

MekkCyber

Very good to only load the invoked attention implementation ! Thanks for adding this

MekkCyber · 2025-10-13T12:38:07Z

src/diffusers/models/attention_dispatch.py

+def _prepare_attention_backend(backend: AttentionBackendName) -> None:
+    preparer = _BACKEND_PREPARERS.get(backend)
+    if preparer is not None:
+        preparer()
+


I'm not sure I understand what the preparer does

diffusers/src/diffusers/models/attention_dispatch.py

Line 463 in 7fd26bc

AttentionBackendName._FLASH_3_HUB: _ensure_flash_attn_3_func_hub_loaded,

Sounds good

sayakpaul added 2 commits October 13, 2025 15:14

refactor how attention kernels from hub are used.

ef6a483

up

7fd26bc

sayakpaul requested a review from DN6 October 13, 2025 10:14

MekkCyber reviewed Oct 13, 2025

View reviewed changes

Merge branch 'main' into refactor-hub-attn-kernels

40baf7d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] Refactor hub attn kernels #12475

[core] Refactor hub attn kernels #12475

Uh oh!

sayakpaul commented Oct 13, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 13, 2025

Uh oh!

MekkCyber left a comment

Uh oh!

MekkCyber Oct 13, 2025

Uh oh!

sayakpaul Oct 13, 2025

Uh oh!

MekkCyber Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[core] Refactor hub attn kernels #12475

Are you sure you want to change the base?

[core] Refactor hub attn kernels #12475

Uh oh!

Conversation

sayakpaul commented Oct 13, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 13, 2025

Uh oh!

MekkCyber left a comment

Choose a reason for hiding this comment

Uh oh!

MekkCyber Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants