[PyTorch Debug] Support precision debug tools for fp8 model parameters. #2141

pggPL · 2025-09-01T10:32:58Z

Description

Currently precision debug tools are not supported for FP8 model parameters. It is because all logic of debug tools is inside quantize() function in DebugQuantizers, which are not called if weight is in FP8. Also, for some stats like number of underflows we need both high precision tensor and quantized tensor.

I added function DebugQunatizer.wrap_quantized_tensor(QuantizedTensor) -> DebugQuantizedTensor which will be called for debug iterations for weight. The debug for all the other tensors work without any change.

I made argument tensor for inspect_tensor call optional - it is None for weight tensor in case of fp8 model parameters.
If one wants to use LogTensorStats, the quantized tensor is dequantized. For LogFp8TensorStats the stats which needs high precision tensor are disabled in considered case.

Fixes #2140

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Pawel Gadzinski <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <[email protected]>

pggPL · 2025-09-01T13:06:51Z

/te-ci pytorch

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <[email protected]>

pggPL · 2025-09-15T08:53:05Z

/te-ci pytorch

pggPL · 2025-09-15T11:08:13Z

/te-ci pytorch

pggPL and others added 9 commits September 1, 2025 10:17

initial code drop

10fdc46

Signed-off-by: Pawel Gadzinski <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

2e5debb

for more information, see https://pre-commit.ci

fixes

11b4c26

Signed-off-by: Pawel Gadzinski <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

8a8eed3

for more information, see https://pre-commit.ci

fix

ecdc727

Signed-off-by: Pawel Gadzinski <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

b0162af

for more information, see https://pre-commit.ci

fixes

20515c7

Signed-off-by: Pawel Gadzinski <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

26926cc

for more information, see https://pre-commit.ci

fix

39d500a

Signed-off-by: Pawel Gadzinski <[email protected]>

pggPL marked this pull request as ready for review September 1, 2025 13:06

pre-commit-ci bot and others added 3 commits September 1, 2025 13:06

[pre-commit.ci] auto fixes from pre-commit.com hooks

206cbc6

for more information, see https://pre-commit.ci

fix

82e3f2c

Signed-off-by: Pawel Gadzinski <[email protected]>

Merge branch 'main' into nvinspect_fp8_model_weights

ef7ef41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PyTorch Debug] Support precision debug tools for fp8 model parameters. #2141

[PyTorch Debug] Support precision debug tools for fp8 model parameters. #2141

Uh oh!

pggPL commented Sep 1, 2025

Uh oh!

pggPL commented Sep 1, 2025

Uh oh!

pggPL commented Sep 15, 2025

Uh oh!

pggPL commented Sep 15, 2025

Uh oh!

Uh oh!

[PyTorch Debug] Support precision debug tools for fp8 model parameters. #2141

Are you sure you want to change the base?

[PyTorch Debug] Support precision debug tools for fp8 model parameters. #2141

Uh oh!

Conversation

pggPL commented Sep 1, 2025

Description

Type of change

Checklist:

Uh oh!

pggPL commented Sep 1, 2025

Uh oh!

pggPL commented Sep 15, 2025

Uh oh!

pggPL commented Sep 15, 2025

Uh oh!

Uh oh!