PQuant🌶️ integration #1362

enlupi · 2025-08-14T09:44:27Z

Description

This draft PR is intended to start a discussion on how to best integrate the PQuant🌶️ library with hls4ml.
The current state of the PR supports layers pruned and quantized using fixed quantizers. Support for high granularity quantization is still in development.

⚠️⚠️ In order for the PQuant layers to be parsed correctly, the code expects the user to run some custom functions that add the quantization parameters as layer attributes (see the test file for examples). In the future these functions should be added to PQuant🌶️. ⚠️⚠️

Type of change

New feature (non-breaking change which adds functionality)

Tests

Unit test test/pytest/test_pquant.py was added to test correct parsing of the pruned/quantized layers and that the precision is set correctly.
⚠️⚠️ THE TEST IS CURRENTLY BROKEN! The file needs to be split into two, to test PyTorch and Keras frontend separately ⚠️⚠️

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

…l reshape

calad0i

Maybe also checkout #1338 so that the bit-exact pass also applies for pquant models - In short, if the quantizers are converted to FixedPointQuantzier object, or linear layers with quantization precision set properly with trusted=True, full model precision propagation (this file) can be used with little changes and all precision inference should work (will need to add rules for the multiplier relu function but likely nothing else).
If you don't want to use that pass, maybe consider deciding if users should use config_from_keras_model or not. If not, maybe leave it untouched and do enforcement in a dedicated pass.

calad0i · 2025-08-16T23:30:59Z

hls4ml/converters/keras_v3/core.py

@@ -33,6 +33,10 @@ def handle(
            'n_out': n_out,
            'n_in': n_in,
        }
+
+        if hasattr(layer, 'quantization_parameters'):
+            config['quantization_parameters'] = layer.quantization_parameters


Not part of standard keras 3 property, should be in your pquant handler (maybe a base class for all pquant layers?)

calad0i · 2025-08-16T23:33:45Z

hls4ml/converters/keras_v3/pquant.py

+            config['shift'] = 0.5
+            # Quartus seems to have trouble if the width is 1.
+            config['slope_prec'] = FixedPrecisionType(width=2, integer=0, signed=False)
+            config['shift_prec'] = FixedPrecisionType(width=2, integer=0, signed=False)


width should be 1 here

calad0i · 2025-08-16T23:35:39Z

hls4ml/converters/keras_v3/pquant.py

+
+
+@register
+class PQuantPoolingHandler(KerasV3LayerHandler):


Looks mostly the same as the original pooling handler. Can reuse some code there I think?

calad0i · 2025-08-19T16:52:43Z

hls4ml/converters/keras_v3/conv.py

@@ -141,4 +141,7 @@ def handle(
        elif isinstance(layer, BaseConv):
            config['weight_data'] = kernel

+        if hasattr(layer, 'quantization_parameters'):
+            config['quantization_parameters'] = layer.quantization_parameters


same, should be in pquant base class

calad0i · 2025-08-19T16:57:38Z

hls4ml/converters/pytorch/convolution.py

@@ -44,6 +44,10 @@ def parse_conv1d_layer(operation, layer_name, input_names, input_shapes, node, c

    output_shape = [input_shapes[0][0], layer['n_filt'], layer['out_width']]  # Channel first as default

+    # Quantization parameter for PQuant integration
+    if hasattr(class_object, "quantization_parameters"):
+        layer['quantization_parameters'] = class_object.quantization_parameters


As the handler is functional you can't really subclassing it so fine. Though, all quantization parameters saved here can be derived from the values themselves so not strictly necessary....

calad0i · 2025-08-19T16:58:52Z

hls4ml/converters/pytorch_to_hls.py

@@ -352,7 +352,10 @@ def parse_pytorch_model(config, verbose=True):
            if '.' not in node.target:
                obj = getattr(model, node.name)
            else:
-                obj = getattr(children[node.target.split('.')[0], node.name])
+                if '_' not in node.name:


Please add a comment explaining why this is needed

calad0i · 2025-08-19T17:02:31Z

hls4ml/templates/vivado/nnet_utils/nnet_activation.h

+        if (datareg > 0) {
+
+            if (mul[0] >= 0)
+                res[ii] = datareg << mul[0];


<< -3 is legal syntax. Why differentiate +/- cases?

calad0i · 2025-08-19T17:08:41Z

hls4ml/utils/config.py

-            # TODO In the next version of this function, these should not be exposed to user to tweak
-            layer_config['Precision'][pname] = str(precision)
+        # PQuant quantization
+        if 'quantization_parameters' in layer:


I am a bit confused here. There are three ways to handle quantzation parameters :

QKeras: parser doesn't say anything about the precision used (actually, some, but almost no), quantization configuration are passed through config generated by config_from_model

(old) HGQ1: a opt pass enforces the quantization config embedded in the model, by default config_from_model does nothing and is not expected to be used

(current) HGQ1/2: a pass infers layer precision config from the weights/activation quantizers w/o explicit passing them. config_from_model still does nothing and not expect to be used

Since you are embedding config to parsed layer dictionary, I am assuming you are going with way 2, but why adding config_from_model precision here? Are the users expected to use config_from_model when converting pquant models? If the two conflicts after user modification, which takes priority?

Enrico Lupi and others added 12 commits July 1, 2025 13:44

FIX bug in pytorch model parsing

36d1740

FIX tracing issue or pquant activation layers

3e300a3

ADD parsing for quantized activation layers

0b25401

ADD pquant layers parsing

c63c7ca

ADD multiplier ReLU layer

e73f4f7

ADD support for pooling layers

857cbe8

ADD pquant integration for Keras frontend

450b503

ADD pquant integration for activations and pooling in keras v3

f51faca

Merge branch 'main' into pquant_integration

34327ec

Merge branch 'main' into pquant_integration

50d4d69

NOT WORKINGgit add test_pquant.py! fix keras backend and pytorch mode…

e3440b2

…l reshape

Merge branch 'main' into pquant_integration

8271847

calad0i reviewed Aug 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PQuant🌶️ integration #1362

PQuant🌶️ integration #1362

Uh oh!

enlupi commented Aug 14, 2025 •

edited

Loading

Uh oh!

calad0i left a comment

Uh oh!

calad0i Aug 16, 2025

Uh oh!

calad0i Aug 16, 2025

Uh oh!

calad0i Aug 16, 2025

Uh oh!

calad0i Aug 19, 2025

Uh oh!

calad0i Aug 19, 2025

Uh oh!

calad0i Aug 19, 2025

Uh oh!

calad0i Aug 19, 2025

Uh oh!

calad0i Aug 19, 2025

Uh oh!

Uh oh!

PQuant🌶️ integration #1362

Are you sure you want to change the base?

PQuant🌶️ integration #1362

Uh oh!

Conversation

enlupi commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tests

Checklist

Uh oh!

calad0i left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

enlupi commented Aug 14, 2025 •

edited

Loading