[#2188] Align test_workflow_ops_xpu tests with pytorch. by jmamzax · Pull Request #2893 · intel/torch-xpu-ops

jmamzax · 2026-02-16T09:22:57Z

Part of #2188 issue. Tests test_learnable_forward_per_channel_cuda_xpu, test_learnable_backward_per_channel_cuda_xpu were updated to match pytorch test cases.

test_learnable_forward_per_channel_cuda_xpu test_learnable_backward_per_channel_cuda_xpu

…r_channel_cuda

Copilot

Pull request overview

Updates the XPU quantization workflow tests to match upstream PyTorch’s learnable per-channel fake-quant test cases, addressing failures tracked in #2188.

Changes:

Removed Hypothesis-driven input generation for the two learnable per-channel CUDA/XPU tests and replaced it with fixed shapes/axes.
Added dtype coverage for the learnable per-channel forward/backward tests (float32 and bfloat16).
Dropped the unused to_tensor import after refactoring the backward test setup.

Comments suppressed due to low confidence (2)

test/xpu/quantization/core/test_workflow_ops_xpu.py:104

shape = (2, 1, 2, 10) with axis = 1 makes channel_size = X_base.size(axis) equal to 1, so this “per-channel” test only exercises the single-channel case and won’t catch channel-dependent bugs. Consider using a shape/axis combination where the selected dimension is > 1 (while still matching the intended PyTorch reference).

    shape = (2, 1, 2, 10)
    axis = 1

    for dtype in [torch.float32, torch.bfloat16]:
        X_base = torch.randn(shape, device="xpu").to(dtype)
        channel_size = X_base.size(axis)

test/xpu/quantization/core/test_workflow_ops_xpu.py:106

Inside the dtype loop, torch.randn(...).to(dtype) (and the subsequent .to(dtype) conversions) introduces extra allocations/copies on XPU. Prefer creating tensors with the target dtype directly (e.g., pass dtype= to the factory functions) to keep the test lighter and reduce overhead.

    for dtype in [torch.float32, torch.bfloat16]:
        X_base = torch.randn(shape, device="xpu").to(dtype)
        channel_size = X_base.size(axis)
        scale_base = (
            torch.normal(mean=0, std=1, size=(channel_size,)).clamp(1e-4, 100).to(dtype)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…r_channel_cuda

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-06T07:38:35Z

test/xpu/quantization/core/test_workflow_ops_xpu.py

+        scale_base = (
+            torch.normal(mean=0, std=1, size=(channel_size,)).clamp(1e-4, 100).to(dtype)
+        )
+        zero_point_base = torch.normal(mean=0, std=128, size=(channel_size,)).to(dtype)


In _test_learnable_forward_per_channel_cuda, scale_base and zero_point_base are created on the CPU and only cast to dtype. Since X_base is on XPU, this introduces host→device transfers (or potential device-mismatch issues if the downstream helper doesn’t move them). Consider creating these tensors directly on device='xpu' (and with the target dtype at creation) to keep all inputs on the same device and avoid extra copies.

Suggested change

scale_base = (

torch.normal(mean=0, std=1, size=(channel_size,)).clamp(1e-4, 100).to(dtype)

)

zero_point_base = torch.normal(mean=0, std=128, size=(channel_size,)).to(dtype)

scale_base = torch.normal(

mean=0,

std=1,

size=(channel_size,),

device="xpu",

dtype=dtype,

).clamp(1e-4, 100)

zero_point_base = torch.normal(

mean=0,

std=128,

size=(channel_size,),

device="xpu",

dtype=dtype,

)

[intel#2188] Align test_workflow_ops_xpu tests with pytorch.

21cd127

test_learnable_forward_per_channel_cuda_xpu test_learnable_backward_per_channel_cuda_xpu

jmamzax added the bug_fix_stage5 label Feb 16, 2026

Fix lintrunner errors

5a6eb6f

kdrozd-dev removed the bug_fix_stage5 label Feb 18, 2026

jmamzax requested a review from Silv3S February 23, 2026 08:17

Silv3S approved these changes Feb 24, 2026

View reviewed changes

astachowiczhabana linked an issue Feb 24, 2026 that may be closed by this pull request

[Bug Skip]: new failures in 2025-10-17 #2188

Open

Merge branch 'main' into dev/jmamzax/issue-2188_fix_test_learnable_pe…

f69a8fc

…r_channel_cuda

Copilot AI review requested due to automatic review settings March 2, 2026 11:04

Copilot started reviewing on behalf of jmamzax March 2, 2026 11:05 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

jmamzax added 2 commits March 5, 2026 08:24

Merge branch 'main' into dev/jmamzax/issue-2188_fix_test_learnable_pe…

379f9f1

…r_channel_cuda

Merge branch 'main' into dev/jmamzax/issue-2188_fix_test_learnable_pe…

7ee63ee

…r_channel_cuda

Copilot AI review requested due to automatic review settings March 6, 2026 07:34

Copilot started reviewing on behalf of jmamzax March 6, 2026 07:35 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#2188] Align test_workflow_ops_xpu tests with pytorch.#2893

[#2188] Align test_workflow_ops_xpu tests with pytorch.#2893
jmamzax wants to merge 5 commits intointel:mainfrom
jmamzax:dev/jmamzax/issue-2188_fix_test_learnable_per_channel_cuda

jmamzax commented Feb 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-        scale_base = (
-            torch.normal(mean=0, std=1, size=(channel_size,)).clamp(1e-4, 100).to(dtype)
-        )
-        zero_point_base = torch.normal(mean=0, std=128, size=(channel_size,)).to(dtype)
+        scale_base = torch.normal(
+            mean=0,
+            std=1,
+            size=(channel_size,),
+            device="xpu",
+            dtype=dtype,
+        ).clamp(1e-4, 100)
+        zero_point_base = torch.normal(
+            mean=0,
+            std=128,
+            size=(channel_size,),
+            device="xpu",
+            dtype=dtype,
+        )

Conversation

jmamzax commented Feb 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants