add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass #768

ankitm3k · 2025-08-04T11:37:04Z

Description

This PR enables the GPU Scale Transform Pass by detecting the UINT16 & INT16 Initializers type in the Q/DQ Nodes in the graph. This isolates the dependency on using the enable_qdq_optimizer provider option pass in the legacy OVEP code.

ankitm3k · 2025-08-04T11:44:30Z

@mklimenk Please test, review & merge

mklimenk

Please add tests to the IsQDQGraphWithUint16OrInt16() function to make sure that we cover all the cases.
Also please remove excessive comments, there's no need to be that explicit.

mklimenk · 2025-08-04T12:46:17Z

onnxruntime/core/providers/openvino/backend_manager.cc

+  auto is_16bit_tensor = [](const onnxruntime::NodeArg* node_arg) -> bool {
+    if (!node_arg) return false;
+    const auto* type_proto = node_arg->TypeAsProto();
+    if (type_proto && type_proto->has_tensor_type()) {
+      auto elem_type = type_proto->tensor_type().elem_type();
+      return (elem_type == ONNX_NAMESPACE::TensorProto_DataType_UINT16 ||
+              elem_type == ONNX_NAMESPACE::TensorProto_DataType_INT16);
+    }
+    return false;
+  };


Please move it to a separate function, there's no need for long multi-line lambdas

mklimenk · 2025-08-04T12:46:52Z

onnxruntime/core/providers/openvino/backend_manager.cc

+        // QuantizeLinear: [float_input, scale, zero_point] -> [quantized_output]
+        // The quantized output tensor determines the quantization type


Please remove identical comments

mklimenk · 2025-08-04T12:47:45Z

onnxruntime/core/providers/openvino/backend_manager.cc

+
+        // Zero point (index 2) must match quantized tensor type per ONNX spec
+        // It's optional - absent for INT32 and some float8 types
+        if (input_defs.size() >= 3 && is_16bit_tensor(input_defs[2])) {


Should it be output_defs[2]? It seems like the portion in the previous condition

yes its the zero_point dtype that is tested to check the INT16/UINT16 dtype

ankitm3k requested review from sfatimar and vthaniel August 4, 2025 11:37

ankitm3k changed the title ~~add: detect Q/DQ with int16/uint16 initializers~~ add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass Aug 4, 2025

mklimenk suggested changes Aug 4, 2025

View reviewed changes

add: detect Q/DQ with int16/uint16 initializers

6ceb8e7

ankitm3k force-pushed the ankit/gpu_qdq_changes branch from fbf966a to 6ceb8e7 Compare August 4, 2025 15:50

ankitm3k requested a review from javier-intel August 4, 2025 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass #768

add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass #768

Uh oh!

ankitm3k commented Aug 4, 2025

Uh oh!

ankitm3k commented Aug 4, 2025

Uh oh!

mklimenk left a comment

Uh oh!

mklimenk Aug 4, 2025

Uh oh!

ankitm3k Aug 4, 2025

Uh oh!

mklimenk Aug 4, 2025

Uh oh!

ankitm3k Aug 4, 2025

Uh oh!

mklimenk Aug 4, 2025

Uh oh!

ankitm3k Aug 4, 2025

Uh oh!

Uh oh!

		// QuantizeLinear: [float_input, scale, zero_point] -> [quantized_output]
		// The quantized output tensor determines the quantization type

add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass #768

Are you sure you want to change the base?

add: detect Q/DQ with int16/uint16 initializers for GPU Scale Transform Pass #768

Uh oh!

Conversation

ankitm3k commented Aug 4, 2025

Description

Uh oh!

ankitm3k commented Aug 4, 2025

Uh oh!

mklimenk left a comment

Choose a reason for hiding this comment

Uh oh!

mklimenk Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

ankitm3k Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

mklimenk Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

ankitm3k Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

mklimenk Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

ankitm3k Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!