[PyTorch] Add ops for dropout and constant scale #1995

timmoon10 · 2025-07-25T05:10:25Z

Description

This PR adds fusible ops for dropout (heavily based on torch.nn.Dropout) and multiplying by a constant scalar. These are not performant, but they provide an API for future custom implementations or fusions.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Add fusible ops for dropout and constant scale

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Tim Moon <[email protected]>

for more information, see https://pre-commit.ci

timmoon10 · 2025-07-25T05:11:56Z

/te-ci pytorch

negvet

LGTM

negvet · 2025-07-25T15:38:52Z

transformer_engine/pytorch/ops/basic/dropout.py

+        is_training = self.training
+        mask = None
+        if is_training:
+            keep_prob = 1 - self.dropout_probability


Handing this case similar to torch.nn.Dropout:

Suggested change

keep_prob = 1 - self.dropout_probability

if self.dropout_probability == 1:

mask = torch.zeros_like(input_)

out = mask

else:

keep_prob = 1 - self.dropout_probability

The existing impl should handle this case correctly. We will also replace this mask-based impl soon, so no need to optimize aggressively.

Add ops for dropout and constant scale

7be1fdf

Signed-off-by: Tim Moon <[email protected]>

timmoon10 requested a review from negvet July 25, 2025 05:10

timmoon10 and others added 2 commits July 24, 2025 22:10

Merge branch 'main' into lora-ops

439d9f4

[pre-commit.ci] auto fixes from pre-commit.com hooks

3a028d9

for more information, see https://pre-commit.ci

negvet approved these changes Jul 25, 2025

View reviewed changes

Merge branch 'main' into lora-ops

a13119b

timmoon10 merged commit c6c1f50 into NVIDIA:main Jul 25, 2025
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PyTorch] Add ops for dropout and constant scale #1995

[PyTorch] Add ops for dropout and constant scale #1995

Uh oh!

timmoon10 commented Jul 25, 2025

Uh oh!

timmoon10 commented Jul 25, 2025

Uh oh!

negvet left a comment

Uh oh!

negvet Jul 25, 2025 •

edited

Loading

Uh oh!

timmoon10 Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

-            keep_prob = 1 - self.dropout_probability
+            if self.dropout_probability == 1:
+                mask = torch.zeros_like(input_)
+                out = mask
+            else:
+                keep_prob = 1 - self.dropout_probability

[PyTorch] Add ops for dropout and constant scale #1995

[PyTorch] Add ops for dropout and constant scale #1995

Uh oh!

Conversation

timmoon10 commented Jul 25, 2025

Description

Type of change

Changes

Checklist:

Uh oh!

timmoon10 commented Jul 25, 2025

Uh oh!

negvet left a comment

Choose a reason for hiding this comment

Uh oh!

negvet Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timmoon10 Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

negvet Jul 25, 2025 •

edited

Loading