Releases · illuin-tech/vidore-benchmark

19 Feb 10:02

tonywu71

v5.0.0

d167f9c

Latest

Description

This release is an extensive package restructure, making vidore-benchmark faster and easier to use. Among the main new features:

⚡ Faster eval with optimized data loading
💽 Support for BEIR format
🖥️ A cleaner, more intuitive CLI
🐍 Evaluate vision retrieval straight from the Python API

🚨 This releases introduces many breaking changes, thus it is NOT backward-compatible with previous versions.

Features

Added

Add CLI eval support for ColQwen2, DSEQwen2, Cohere, ColIdefics3 API embedding models
Add Pydantic models for storing the ViDoRe benchmark results and metadata (includes vidore-benchmark version)
Add option to create an EvalManager instance from ViDoReBenchmarkResults
Add num_workers argument when using dataloaders
Allow the creation of a VisionRetriever instance using a PyTorch model and a processor that implements a process_images and a process_queries methods, similarly to the ColVision processors
Add dataloader_prebatch_query and dataloader_prebatch_passage arguments to avoid loading the entire datasets in memory (used to cause RAM spikes when loading large image datasets)
Add QA-to-BEIR dataset format conversion script
Add support for the BEIR dataset format with ViDoReEvaluatorBEIR

Changed

[Breaking] Change the CLI argument names
Add option to load a specific checkpoint for Hf models with pretrained_model_name_or_path
Improve soft dependency handling in retriever classes (the colpali-engine is now optional)
[Breaking] Change the get_scores signature
[Breaking] Rename forward_documents to forward_passages to match the literature and reduce confusion
[Breaking] Rename DSERetriever into DSEQwen2Retriever
[Breaking] Rename args in the CLI script
When available, use processor.get_scores instead of custom scoring snippet
[Breaking] Rename ColQwenRetriever to ColQwen2Retriever
[Breaking] Rename BiQwenRetriever to BiQwen2Retriever
[Breaking] Revamp the evaluate module. Evaluation is now handled by the ViDoReEvaluatorQA class
[Breaking] Rename ViDoReEvaluator into BaseViDoReEvaluator. The new ViDoReEvaluator class allows to create retrievers using the Python API.
Set default num_workers to 0 in retrievers
Update default checkpoints for ColPali and ColQwen2 retrievers

Fixed

Fix evaluate_dataset when used with the BM25 retriever
Fix issue when no pretrained_model_name_or_path = None in load_vision_retriever_from_registry
Fix DummyRetriever's get_scores method
Fix processor output not being sent to the correct device in ColQwen2Retriever
Fix bugs in BiQwen2Retriever
Fix try-catch block for soft dep check in SigLIPRetriever

Removed

Remove experimental quantization module
Remove the interpretability module. The interpretability code has been moved and improved as part of the colpali-engine==0.3.2 release.
[Breaking] Remove support for token pooling. This feature will be re-introduced in colpali-engine>0.3.9
Replace loguru with built-in logging module
Remove the retrieve_on_dataset and retrieve_on_pdfs entrypoint scripts
Remove the pdf_utils module
Remove the get_top_k method from the evaluate module
Remove the plot_utils and test_utils modules
Remove the experiments directory

Tests

Add tests for all built-in vision retrievers
Add fixtures in retriever tests to speed up testing
Add tests for ViDoReBenchmarkResults
Add tests for EvalManager
Add tests and E2E tests for cli command evaluate-retriever
Add tests for ViDoReEvaluatorBEIR

Full Changelog: v4.0.2...v5.0.0

Assets 2

17 Oct 09:27

tonywu71

v4.0.2

5e062df

v4.0.2

Features

Deprecated

Deprecate the interpretability module

Build

Fix and update conflicts for package dependencies

Assets 2

07 Oct 21:48

tonywu71

v4.0.1

c217f52

v4.0.1

Features

Changed

Rename ColPali model alias to match model name (use --model-name vidore/colpali instead of --model-name vidore/colpali-v1.2 with the vidore-benchmark evaluate-retriever CLI)
Use the ColPali model name to load ColPaliProcessor instead of the PaliGemma one

Fixed

Add missing model.eval() to all vision retrievers to make results deterministic

Full Changelog: v4.0.0...v4.0.1

Assets 2

21 Sep 10:23

tonywu71

v4.0.0

56bf016

v4.0.0

Description

🚨 This release is breaking as it leverages the restructured colpali-engine package. In particular, the duplicate code logic was removed. Thus, it is NOT backward-compatible with previous versions.

Features

Added

Add "Ruff" and "Test" CI pipelines
Add upper bound for colpali-engine to prevent eventual breaking changes (Follow-up of #30)

Changed

Remove unused deps from pyproject
Clean pyproject
Bump colpali-engine to v0.3.0 and adapt code for the new API
Replace black with ruff linter
Add better ColPali model loading

Removed

Remove duplicate code with colpali-engine (e.g. remove ColPaliProcessor, ColPaliScorer...)

Fixed

Change typing to support Python 3.9
Fix the generate_similarity_maps CLI
Various fixes

Full Changelog: v3.4.2...v4.0.0

Assets 2

09 Sep 13:00

tonywu71

v3.4.2

563c06b

v3.4.2

Features

Fixed

Fix typo when making model_name configurable in previous release
Fix wrong image processing for ColPali

Full Changelog: v3.4.1...v3.4.2

Assets 2

09 Sep 08:14

tonywu71

v3.4.1

94cb2f2

v3.4.1

Features

Changed

Make model_name configurable in ColPaliRetriever as optional arg
Tweak EvalManager
Tweak ColPaliProcessor
Improve tests

Full Changelog: v3.4.0...v3.4.1

Assets 2

29 Aug 13:21

tonywu71

v3.4.0

0356bb0

v3.4.0

Features

Changed

Add the colpali-engine dependency
Add sentence-transformers to compulsory dependencies
Bump ColPali to v1.2 [model card]
Remove the baselines dependency group

Full Changelog: v3.3.0...v3.4.0

Assets 2

22 Aug 09:25

tonywu71

v3.3.0

931b08c

v3.3.0

Features

Added

Add int8 quantization for embeddings (experimental)
Add tests for int8 quantization
Add dynamic versioning with hatch-vcs
Add loggers in modules (with loguru)

Changed

Re-organize quantization modules

Fixed

Fix dtype conversion in ColPaliScorer
Fix int8 overflow when computing ColBERT score w/ int8 embeddings
Fix quantization tests

Removed

Remove unused ColPaliWithPoolingRetriever

Full Changelog: v3.2.0...v3.3.0

Assets 2

15 Aug 07:52

tonywu71

v3.2.0

5d158e1

v3.2.0

Features

Added

Add experiments on token pooling
Add from_multiple_json method in EvalManager

Fixed

Fix error when using token pooling with bfloat16 tensors
Add missing L2 normalization in token pooling

Full Changelog: v3.1.0...v3.2.0

Assets 2

06 Aug 09:10

tonywu71

v3.1.0

e502e61

v3.1.0

Features

Added

Add support for token pooling in document embeddings

Fixed

Fix ruff settings

Full Changelog: v3.0.0...v3.1.0

Assets 2

Releases: illuin-tech/vidore-benchmark

v5.0.0: Package restructure

Description

Features

Added

Changed

Fixed

Removed

Tests

Uh oh!

v4.0.2

Features

Deprecated

Build

Uh oh!

v4.0.1

Features

Changed

Fixed

Uh oh!

v4.0.0

Description

Features

Added

Changed

Removed

Fixed

Uh oh!

v3.4.2

Features

Fixed

Uh oh!

v3.4.1

Features

Changed

Uh oh!

v3.4.0

Features

Changed

Uh oh!

v3.3.0

Features

Added

Changed

Fixed

Removed

Uh oh!

v3.2.0

Features

Added

Fixed

Uh oh!

v3.1.0

Features

Added

Fixed

Uh oh!