Propose vector-set API #2939

mgravell · 2025-08-11T13:51:04Z

as per https://redis.io/docs/latest/develop/data-types/vector-sets/

all methods/types start VectorSet...
all core methods implemented
the only "unusual" one is VLINKS; the server returns this as nested data, but the nesting is not meaningful to the caller (instead being related to the server core), so I've flattened the result
usage is shown in VectorSetIntegrationTests, in particular VectorSetSimilaritySearch_WithFilter is useful for overview - since the main two primary APIs are VADD and VSIM
since vector set data can be non-trivial, all return types lean on Lease<T> rather than T[]. Inputs are ReadOnlyMemory<T>; the exception to this is string data for JSON and filters; these are explicitly text, never blobs, so RedisValue seems inappropriate. I think forcing string is OK here
in acknowledgement that some of our files are too large, I've started splitting the VectorSet* bits out via partial files; I will follow this up with "everything else" (.Strings.cs, .Hashes.cs) in a separate PR

This PR also introduces the start of a new literal-matching API, ala FastHash. It is intended that this will be extended at a later date.

CI: may be dependent on the vectorset module; if it fails, I'll add suitable validation. All VectorSetTests pass locally:

mgravell · 2025-08-14T20:36:28Z

Additional thoughts on FastHash:

if length <= 8, we can skip equality test
consider hashing first 16 (status: considered and deferred)
add unit test that shows interesting known values of different lengths (and different values at same length)

- misc FastHash core improvements

src/StackExchange.Redis/VectorSetSimilaritySearchMessage.cs

tests/StackExchange.Redis.Tests/VectorSetIntegrationTests.cs

mgravell · 2025-09-04T16:02:21Z

nits and "avoid the massive number of parameters" type decl: d53f1ab

mgravell · 2025-09-04T16:03:00Z

badrishc · 2025-09-04T21:37:22Z

Brilliant, glad to see this landing as we look into vector sets in Garnet as well.

mgravell · 2025-09-05T15:10:45Z

@NickCraver @philon-msft bump

kevin-montrose · 2025-09-05T18:03:09Z

Brilliant, glad to see this landing as we look into vector sets in Garnet as well.

In this vein, as I'm working on Garnet's Vector Set api support, I want to have a discussion about likely extensions.

It seems quite possible that Redis will add new quantization options (to join NOQUANT, B8, and BIN). Garnet is definitely going to have a few (I've sketched out a XPREQ8, though that name is a place holder, for "NOQUANT but unsigned bytes"), and I expect more to come.

It also seems reasonable for there to be new vector data formats for the commands (to join FP32 and VALUES), given how common reduced precision storage is in ML workloads (FP16 and FP8 for example). Garnet is going to have at least one (which I'm calling XB8 right now, for "each dimension is one byte, [0, 255]").

Obviously all these extensions could be handled with the .ExecuteXXX methods just fine. But I wonder, given the likelihood of extensions from both Redis proper and other RESP implementers, if a something for these specific options might be merited.

Just spit-balling, but a struct (which wraps some Span<byte>/Memory<byte>) instead of an enum for VectorSetQuantization and a new type for values rather than a simple ReadOnlyMemory<float> with an accompanying Span<byte>/Memory<byte> for the flag. With some conversions, this could also accommodate users who have their vectors elements as strings w/o forcing them to parse them client side before sending.

/cc @NickCraver Since we discuss this a little bit this morning.

mgravell · 2025-09-05T19:43:17Z

Sounds interesting. We've moved to an args parameter for that API, so one option here might be to, at some future date, unseal it with a view to using some polymorphic "the subclass can fill this buffer with what it wants" API. I think this gives us any wriggle room for future changes. Thoughts?

In particular, I wonder if I move the existing "by vector" bits into a subclass, presumably with a suitable "fill this" API.

kevin-montrose · 2025-09-05T21:09:04Z

Lemme see if I follow.

Current API for VADD is:

bool VectorSetAdd(
        RedisKey key,
        RedisValue element,
        ReadOnlyMemory<float> values,
        int? reducedDimensions = null,
        VectorSetQuantization quantization = VectorSetQuantization.Int8,
        int? buildExplorationFactor = null,
        int? maxConnections = null,
        bool useCheckAndSet = false,
        string? attributesJson = null,
        CommandFlags flags = CommandFlags.None);

public enum VectorSetQuantization
{
    Unknown,
    None,
    Int8,
    Binary,
}

So we'd change (or add as a new overload) that to something like:

bool VectorSetAdd<TVectorData>(
        RedisKey key,
        RedisValue element,
        TVectorData values,
        int? reducedDimensions = null,
        VectorSetQuantization quantization = VectorSetQuantization.Int8,
        int? buildExplorationFactor = null,
        int? maxConnections = null,
        bool useCheckAndSet = false,
        string? attributesJson = null,
        CommandFlags flags = CommandFlags.None)
where TVectorData: IVectorData;

public readonly struct VectorSetQuantization(ReadOnlyMemory<byte> Name)
{
  public static readonly VectorSetQuantization Unknown = new(default);
  public static readonly VectorSetQuantization None = new("NOQUANT"u8);
  public static readonly VectorSetQuantization Int8 = new("Q8"u8);
  public static readonly VectorSetQuantization Binary = new("BIN"u8);
}

public interface IVectorData
{
  // Some sort of write callback here?
}

That makes sense to me.

mgravell · 2025-09-06T07:36:08Z

Sorry, I was only thinking about VSIM - my bad. It would seem desirable to unify the approach between VSIM and VADD, and maybe marry the data and quant encoding (understanding that there is some overlap). I wonder if we should take a leaf from System.Text.Encoding here. Suggestion:

// Interprets a single T as N elements. T is most likely
// ROM-float, ROM-double, or similar; but is determined
// by the implementation
public abstract class VectorEncoding<TVector>
{
    public abstract ReadOnlySpan<byte> DataEncoding {get;} // "FP32"u8
    public abstract ReadOnlySpan<byte> Quantization {get;} // "INT8"u8
    // note: if -ve reply from TryGetByteCount, VALUES is used instead
    public abstract int TryGetByteCount(in TVector vector); // 4x.Length
    public abstract void GetBytes(in TVector vector, Span<byte> buffer); // Unsafe.Cast ... CopyTo

    // used for VALUES encoding
    public abstract int GetElementCount(in TVector vector); // .Length
    public abstract double GetElement(in TVector vector, int index); // [index]
}

Where both VSIM and VADD could take a TVector and a VectorEncoding-TVector

Thoughts?

mgravell · 2025-09-06T07:39:11Z

Or are the quantization and encoding entirely independent? They don't feel entirely independent....

kevin-montrose · 2025-09-07T18:37:33Z

Or are the quantization and encoding entirely independent? They don't feel entirely independent....

They are, though the feel is weird I agree.

Like, NOQUANT + XB8 still stores floats in the vector set in this scheme, while FP32 + XPREQ stores bytes. The data encoding is transient.

Some of this confusion is because NOQUANT doesn't mention F32, even though that is what is actually means.

mgravell · 2025-09-07T21:01:14Z

OK. Right. I'll see if I can keep the two concepts separate, then. But there's definitely facility in abstraction - no point sending FP32 queries or data-loads to data that is quantized to FP8, so I can easily imagine that you'd allow an efficient transport syntax

mgravell · 2025-09-09T09:02:11Z

@kevin-montrose I looked at this this morning. I think there's a very strong likelihood of over-designing if we don't have concrete requirements, and in any event: we'd probably still want the default and most obvious API / overload to be the existing one. I think, for now, we can do a minimal change here that keeps the door open:

make VectorSetSimilaritySearchRequest abstract
create two internal versions - one for "by member", one for "by vector, FP32"
create two static methods on VSSSR

so:

- [SER001]StackExchange.Redis.VectorSetSimilaritySearchRequest.Member.get -> StackExchange.Redis.RedisValue
- [SER001]StackExchange.Redis.VectorSetSimilaritySearchRequest.Member.set -> void
- [SER001]StackExchange.Redis.VectorSetSimilaritySearchRequest.Vector.get -> System.ReadOnlyMemory<float>
- [SER001]StackExchange.Redis.VectorSetSimilaritySearchRequest.Vector.set -> void
- [SER001]StackExchange.Redis.VectorSetSimilaritySearchRequest.VectorSetSimilaritySearchRequest() -> void
+ [SER001]static StackExchange.Redis.VectorSetSimilaritySearchRequest.ByMember(StackExchange.Redis.RedisValue member) -> StackExchange.Redis.VectorSetSimilaritySearchRequest!
+ [SER001]static StackExchange.Redis.VectorSetSimilaritySearchRequest.ByVector(System.ReadOnlyMemory<float> vector) -> StackExchange.Redis.VectorSetSimilaritySearchRequest!

This means that in the future, we can add whatever other factory methods we need to support additional encodings, and we haven't forced VectorSetSimilaritySearchRequest.Vector to be a fixed type.

On the "add" side: we can just overload trivially.

I'm adding this as a commit - feedback encouraged before I merge ;p

(the actual API to allow extensibility is internal for now - that's fine; all are options are open)

- make VectorSetSimilaritySearchRequest abstract - remove Member and Vector - add ByMember and ByVector factory methods - (internal changes to support the above, in the message etc)

mgravell · 2025-09-09T09:15:41Z

API change formalized ^^^

kevin-montrose · 2025-09-09T14:39:07Z

I'm fine being conservative until we have the final Garnet (or others) implementation to compare against (and obviously, consult about), and that seems reasonable.

Though this still leaves the question of VADD extensions unaddressed, yes?

NickCraver · 2025-09-09T15:11:59Z

src/StackExchange.Redis/VectorSetQuantization.cs

+    /// <summary>
+    /// Binary quantization. This maps to "BIN" or "bin".
+    /// </summary>
+    Binary,


Just in case, let's add int values to this

NickCraver

Discussed doing an object for VectorSetAdd for future additions, otherwise looking good 👍

mgravell · 2025-09-09T16:31:58Z

@kevin-montrose see Nick's comment above: working exactly that

Iniial stab at vector-set API

a2be468

mgravell marked this pull request as draft August 11, 2025 13:51

mgravell added 13 commits August 11, 2025 14:55

Use bool as the return from VADD

e8b580e

working on impl

a7a2f6a

more tests

9a4b9c3

ack experimental

ebce7eb

ack experimental

579133b

links

f20db67

VLINKS impl

4b4225e

implement VSIM message

e0b959d

fixins

f755e7a

core for fast-hash

0ed581e

VINFO complete

d69145d

VSIM; watch out for RESP3+WITHSCORES+WITHATTRIBS, that's a doozy!

8933579

VSIM filter integration tests

828b408

mgravell marked this pull request as ready for review August 13, 2025 15:57

mgravell added 4 commits August 13, 2025 17:04

allow VectorSet as a method prefix (CheckSignatures)

697d312

Split VectorSet* code into partial files

a148d99

Split VectorSet code from ResultProcessor (also: *Lease*)

337cbba

key can be embstr or raw

d82c9a4

mgravell added ⚙️ area:commands ⚙️ area:API 👀 for-review ⚙️ area:vectors labels Aug 14, 2025

mgravell added 6 commits August 14, 2025 16:28

Use code-generator for [FastHash].

6320185

Move the literals to be better scoped.

b14934d

tyop

26a4a62

lost a using directive somehow

f9d3af9

disable spell-checker on literals

51f93c6

literals can be private

c421ccb

mgravell added 2 commits August 15, 2025 11:29

- add FastHashTests

536efe4

- misc FastHash core improvements

Merge branch 'main' into marc/vectorsets

65eb889

mgravell requested review from philon-msft and NickCraver August 19, 2025 15:04

NickCraver reviewed Aug 26, 2025

View reviewed changes

src/StackExchange.Redis/VectorSetSimilaritySearchMessage.cs Show resolved Hide resolved

NickCraver reviewed Aug 26, 2025

View reviewed changes

src/StackExchange.Redis/VectorSetSimilaritySearchMessage.cs Outdated Show resolved Hide resolved

NickCraver reviewed Aug 26, 2025

View reviewed changes

tests/StackExchange.Redis.Tests/VectorSetIntegrationTests.cs Outdated Show resolved Hide resolved

NickCraver reviewed Aug 26, 2025

View reviewed changes

tests/StackExchange.Redis.Tests/VectorSetIntegrationTests.cs Show resolved Hide resolved

fix PR nits

d53f1ab

mgravell requested a review from NickCraver September 4, 2025 16:03

change VSIM API to allow future extensivility:

2157db1

- make VectorSetSimilaritySearchRequest abstract - remove Member and Vector - add ByMember and ByVector factory methods - (internal changes to support the above, in the message etc)

remove redundant property

a53365b

NickCraver reviewed Sep 9, 2025

View reviewed changes

NickCraver approved these changes Sep 9, 2025

View reviewed changes

Propose vector-set API #2939

Are you sure you want to change the base?

Propose vector-set API #2939

Uh oh!

Conversation

mgravell commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgravell commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mgravell commented Sep 4, 2025

Uh oh!

mgravell commented Sep 4, 2025

Uh oh!

badrishc commented Sep 4, 2025

Uh oh!

mgravell commented Sep 5, 2025

Uh oh!

kevin-montrose commented Sep 5, 2025

Uh oh!

mgravell commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevin-montrose commented Sep 5, 2025

Uh oh!

mgravell commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgravell commented Sep 6, 2025

Uh oh!

kevin-montrose commented Sep 7, 2025

Uh oh!

mgravell commented Sep 7, 2025

Uh oh!

mgravell commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgravell commented Sep 9, 2025

Uh oh!

kevin-montrose commented Sep 9, 2025

Uh oh!

NickCraver Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

NickCraver left a comment

Choose a reason for hiding this comment

Uh oh!

mgravell commented Sep 9, 2025

Uh oh!

Uh oh!

mgravell commented Aug 11, 2025 •

edited

Loading

mgravell commented Aug 14, 2025 •

edited

Loading

mgravell commented Sep 5, 2025 •

edited

Loading

mgravell commented Sep 6, 2025 •

edited

Loading

mgravell commented Sep 9, 2025 •

edited

Loading