[ROCm] Stop hipifying native/cudnn/ and native/quantized/cudnn/ files by rkayaith · Pull Request #3196 · ROCm/pytorch

rkayaith · 2026-05-01T22:01:28Z

This PR adds hipDNN as a supported backend for SDPA.

The approach taken here is to re-use the existing CUDNN_ATTENTION backend, adding support for it by routing through hipDNN when compiled for ROCm. i.e. torch.nn.attention.SDPBackend.CUDNN_ATTENTION and aten::_scaled_dot_product_cudnn_attention now work on ROCm.

The primary change here is adding cudnn/hip/MHA.cpp, which is a "fork" of cudnn/MHA.cpp, modified to use hipDNN instead of cuDNN. There's various differences between the implementations that made it simpler to just fork the entire file rather than trying to keep both implementations in the same file/rely on hipify for translation:

various minor API differences between cuDNN/hipDNN which made it difficult to re-use code directly
differences in feature support: hipDNN doesn't support nested tensors which simplifies a lot of code, however it is more flexible in other aspects e.g. dtypes.

Additionally, since hipDNN provides an API for querying engine support for a graph, during backend selection sdp::can_use_cudnn_attention calls the newly added at::native::check_cudnn_sdpa_support, which constructs the hipDNN graph for both forwards and backwards (if potentially needed) to query support. This is not cached, as it's assumed construction + querying support is implemented efficiently. The sequence of hipDNN calls:

                SELECTION
                ─────────
       sdp::can_use_cudnn_attention()
                    │
                    ▼
    at::native::check_cudnn_sdpa_support()
                    │
                    ▼
     fe::graph::Graph::is_supported_ext()
                    │
                    ▼
                   bool
    
                EXECUTION
                ─────────
  at::native::run_cudnn_SDP_{fprop,bprop}()
                    │
                    ▼
           MHAGraphCache lookup
                    │
               ┌────┴────┐
               ▼         ▼
             HIT:      MISS:
             reuse       │
             cached      ▼
             graph     fe::graph::Graph::validate()
               │       fe::graph::Graph::build_operation_graph()
               │       fe::graph::Graph::create_execution_plans()
               │       fe::graph::Graph::check_support()
               │       fe::graph::Graph::build_plans()
               │         │
               └────┬────┘
                    ▼
         fe::graph::Graph::execute()

Both backends still share the same attention.cu kernel (hipDNN uses the hipified version).

This is separated into a stack of PRs for easier review:

PR 1/3: [ROCm] Stop hipifying native/cudnn/ and native/quantized/cudnn/ files (this PR)
- Various cudnn files are being hipified at the moment, with no functional purpose (it just results in code that's ifdef'd out. This disables the hipification rules and fixes up includes/directives so the CUDA files compile cleanly on ROCM. The primary motivation here is to stop generating cudnn/hip/MHA.cpp; the following changes add this file back with the hipDNN backend implementation.
PR 2/3: [ROCm] Integrate hipDNN as an SDPA backend:
- I'd recommend reviewing the individual commits here:
  - Add aotriton.images/ to .gitignore - NFC change
  - Extract compute_matching_strides from alloc_with_matching_layout - NFC change
  - Split cudnn/MHA.cpp into separate cuDNN and hipDNN files
  - Copy cuDNN MHA implementation verbatim into hipDNN MHA
    - This is just to make it easier to review the next commit, as it shows the diff between cuDNN and hipDNN.
  - [ROCm] Add hipDNN SDPA backend dispatch
    - Primary implementation, looking at this commit will likely be the easiest way to review the hipDNN specific changes.
  - Update tests to reflect cudnn backend being available on ROCM.
    - CUDNN_ATTENTION tests can now be run on ROCm.
PR 3/3: [ROCm] Pass bool attention masks directly to hipDNN
- hipDNN supports boolean attention masks more efficiently than float masks. This skips the float->bool conversion that's normally done, and passes the original mask directly to hipDNN.

Open questions for reviewers:

Reusing the CUDNN_ATTENTION backend was done due to API similarities, though this could potentially be confusing. Would it be preferrable to completely separate hipDNN by adding a new backend?
Would additional code-sharing between the cuDNN and hipDNN MHA.cpp files be recommended?

These files are guarded by `#if AT_CUDNN_ENABLED()` which is always 0 on ROCm, so only stub implementations compile. Hipify was making text substitutions (cudnnHandle_t → miopenHandle_t, etc.) to code that is entirely dead on ROCm. Include fixes needed to compile without hipify: - RNN.cpp: move CUDAEvent.h, CUDAGraphsUtils.cuh, Exceptions.h into the `#else // AT_CUDNN_ENABLED()` block (only used by real impl) - LossCTC.cpp: remove unused CUDAGraphsUtils.cuh include - BatchNorm.cpp, Module.cpp, attention.cu, attention_backward.cu: remove `#ifdef __HIP_PLATFORM_AMD__` guards that selected hipified header paths (cudnn/hip/MHA.h, cudnn/hip/BatchNorm.h) — use the originals directly since hipify no longer runs on these files The quantized/cudnn/ files additionally had redundant `#ifdef USE_CUDA` guards wrapping the entire file. These are only compiled in CUDA/ROCm builds (gated by cmake), so the guards were dead code. Authored with Claude.

rocm-repo-management-api · 2026-05-01T22:35:27Z

Jenkins build for fe50fbc0508906dcdcf0bd2b487580278c268a2d commit finished as FAILURE
Links: Pipeline Overview / Build artifacts / Test Results

rocm-repo-management-api · 2026-05-01T23:05:38Z

Jenkins build for fe50fbc0508906dcdcf0bd2b487580278c268a2d commit finished as FAILURE
Links: Pipeline Overview / Build artifacts / Test Results

zjgarvey

Some questions. Definitely want an eye from @jeffdaily on some of these changes, too.

zjgarvey

Looks good to me, but might be good to have @jeffdaily take a look to see if there is anything that might be problematic about the hipify changes.

This was referenced May 1, 2026

[ROCm] Integrate hipDNN as an SDPA backend #3197

Open

[ROCm] Pass bool attention masks directly to hipDNN #3198

Open

rkayaith changed the title ~~PR 1/3: Stop hipifying native/cudnn/ and native/quantized/cudnn/ files~~ [ROCm] Stop hipifying native/cudnn/ and native/quantized/cudnn/ files May 1, 2026

rkayaith marked this pull request as ready for review May 4, 2026 17:07

rkayaith requested review from jeffdaily and jithunnair-amd as code owners May 4, 2026 17:07

zjgarvey reviewed May 4, 2026

View reviewed changes

Comment thread aten/src/ATen/native/cudnn/RNN.cpp

Comment thread aten/src/ATen/CMakeLists.txt

zjgarvey approved these changes May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Stop hipifying native/cudnn/ and native/quantized/cudnn/ files#3196

[ROCm] Stop hipifying native/cudnn/ and native/quantized/cudnn/ files#3196
rkayaith wants to merge 1 commit intohipdnn_developfrom
users/rkayaith/sdpa-hipdnn-stop-hipify

rkayaith commented May 1, 2026 •

edited

Loading

Uh oh!

rocm-repo-management-api Bot commented May 1, 2026 •

edited

Loading

Uh oh!

rocm-repo-management-api Bot commented May 1, 2026 •

edited

Loading

Uh oh!

zjgarvey left a comment

Uh oh!

Uh oh!

Uh oh!

zjgarvey left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rkayaith commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rocm-repo-management-api Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rocm-repo-management-api Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zjgarvey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zjgarvey left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rkayaith commented May 1, 2026 •

edited

Loading

rocm-repo-management-api Bot commented May 1, 2026 •

edited

Loading

rocm-repo-management-api Bot commented May 1, 2026 •

edited

Loading