Skip to content

Conversation

@sducouedic
Copy link

When indexing big document corpus, the embedding runs slower and slower. The reason is at each iteration, all the embedding vectors are stored, instead of only the newly created ones. The changes in this PR allow to save only the necessary embedding files (new files and file to be modified) instead of all the files.

Signed-off-by: Sophie du Couédic <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant