[FEATURE] Lucene Inbuilt Scalar Quantizer to convert float 32 bits to 4 bits #2252

naveentatikonda · 2024-11-07T03:24:38Z

Description

Since OpenSearch 2.17 we have support for Lucene Inbuilt Scalar Quantizer which accepts fp32 vectors as input and dynamically quantizes the data into int7 ranging from [0 to 127] providing 4x compression. Adding support for 4 bits to the Lucene SQ provides 8x compression which helps to quantize fp32 vectors into int4 ranging from [0 to 15], which helps to further reduce the memory requirements by trading off recall.

naveentatikonda added the v2.19.0 label Nov 7, 2024

naveentatikonda self-assigned this Nov 7, 2024

naveentatikonda added this to Vector Search RoadMap Nov 7, 2024

github-project-automation bot moved this to Backlog in Vector Search RoadMap Nov 7, 2024

naveentatikonda moved this from Backlog to 2.19.0 in Vector Search RoadMap Nov 7, 2024

github-actions bot added the untriaged label Nov 7, 2024

naveentatikonda removed the untriaged label Nov 7, 2024

This was referenced Nov 7, 2024

[DOC] Lucene Inbuilt Scalar Quantizer to quantize float 32 bits to 4 bits opensearch-project/documentation-website#8689

Open

Add support for Lucene int4 SQ #2253

Open

navneet1v added the memory-reduction label Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Lucene Inbuilt Scalar Quantizer to convert float 32 bits to 4 bits #2252

[FEATURE] Lucene Inbuilt Scalar Quantizer to convert float 32 bits to 4 bits #2252

naveentatikonda commented Nov 7, 2024

[FEATURE] Lucene Inbuilt Scalar Quantizer to convert float 32 bits to 4 bits #2252

[FEATURE] Lucene Inbuilt Scalar Quantizer to convert float 32 bits to 4 bits #2252

Comments

naveentatikonda commented Nov 7, 2024

Description