Streamlit LlamaIndex RAG with Citations

This project is a web application built with Streamlit that implements a Retrieval-Augmented Generation (RAG) pipeline using LlamaIndex. It allows users to ask questions about a collection of documents and receive answers with citations, pointing to the specific sources of information within the documents.

The application uses ChromaDB as a vector store to efficiently retrieve relevant context from the documents in the data directory.

These are screenshots of an ongoing conversation, showing the response of the LLM with source citation:

Features

Interactive chat interface powered by Streamlit.
Query documents using natural language.
Receive answers generated by a Large Language Model (LLM).
Answers include citations to the source documents for verification.
Uses LlamaIndex for the RAG pipeline.
Uses ChromaDB for vector storage.

Tech Stack

Setup and Installation

Clone the repository:

git clone <repository-url>
cd streamlit-llamaindex-rag-citation

Create and activate a Python virtual environment:

python -m venv citationenv
source citationenv/bin/activate
# On Windows, use:
# citationenv\Scripts\activate

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up your API keys: Create a file named secrets.toml inside a .streamlit directory.
```
.streamlit/
└── secrets.toml
```
Add your OpenAI API key to secrets.toml:
```
OPENAI_API_KEY = "sk-..."
```

Usage

Add your data: Place the PDF documents you want to query in the data directory. The project comes with some example documents.
Run the Streamlit application:
```
streamlit run citation_app.py
```
Open your browser: Navigate to the local URL provided by Streamlit (usually http://localhost:8501).

Project Structure

.
├── .gitignore
├── citation_app.py         # Main Streamlit application
├── readme.md               # This file
├── requirements.txt        # Python dependencies
├── .streamlit/
│   └── secrets.toml        # Secrets management for Streamlit
├── chroma_db/              # ChromaDB vector store
├── citation/               # LlamaIndex storage
├── citationenv/            # Python virtual environment
└── data/                   # Source documents (PDFs)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Streamlit LlamaIndex RAG with Citations

Features

Tech Stack

Setup and Installation

Usage

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
chroma_db		chroma_db
citation		citation
data		data
.gitignore		.gitignore
citation_app.py		citation_app.py
image-1.png		image-1.png
image.png		image.png
readme.md		readme.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Streamlit LlamaIndex RAG with Citations

Features

Tech Stack

Setup and Installation

Usage

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages