-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is Euclidean distance not supported? #33
Comments
Thanks for your interesting of our project. It is possible to support euclidean distance. Please take a look of the "score" package: There are two suite of APIs in the implementation of a score function, one is for dense vector set, the other is for sparse vector set. And the rest of the API are all event hooks. If you could implement this feature, it is highly plausible. Or we can take this but will be due in late next week. Thanks again. |
Supporting Manhattan distance also seems very good, thanks! but, I couldn't fully understand integer vector score function because I didn't fully read and understand simbase code ==>
I'm just in the phase of planning new service, I cannot test simbase code right now... |
Hi, @bwlim , The integer vector API is for the sparse vectors. Sparsity is very common in high dimensional data, in this scenario, dense storage format is very ineffective, so we introduce sparse storage format. For example, we have a 1024 dimensional base, the two format are as below
|
I'm very happy to see open source Vector Database!
Simbase is great for me, thanks :D
I have a question (or maybe new feature request..)
Supported similarity(score) functions are "cosinesq" and "jensenshannon"
cosine similarity function does not count vector magnitude..
But in my application, vector magnitude is meaningful for similar vector search.
I want similarity function using "Euclidean distance" to be supported also :D
Give some guides, thanks for your great vector DB :D
The text was updated successfully, but these errors were encountered: