A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
-
Updated
May 9, 2025 - Python
A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
Experimenting with different kinds of RAGs Systems
The course provides a comprehensive guide to optimizing retrieval systems in large-scale RAG applications. It covers tokenization, vector quantization, and search optimization techniques to enhance search quality, reduce memory usage, and balance performance in vector search systems.
This project processes and retrieves information from PDF file or PDF collection. It leverages Qdrant as a vector database for similarity searches and employs a Retrieval-Augmented Generation (RAG).
Implements a Retrieval-Augmented Generation (RAG) system.
Training Data Generator for SPLADE Model Fine-tuning
Add a description, image, and links to the rag-systems topic page so that developers can more easily learn about it.
To associate your repository with the rag-systems topic, visit your repo's landing page and select "manage topics."