Multimodal Document Chatbot

Inspired by https://archive.is/2025.04.13-061540/https://medium.com/data-science-collective/rag-in-action-build-your-own-local-pdf-chatbot-as-a-beginner-96c2833869ff

Multimodal Document Chatbot

A powerful chatbot that can answer questions about your documents - including PDFs, Word documents, text files, and images with text.

Features

Multiple Document Types: Process PDFs, Word documents (.docx), text files, and images (.jpg, .png, etc.)
Built-in OCR: Extract text from images and scanned documents using EasyOCR
Semantic Search: Find relevant information across all your documents
Claude AI Integration: Get intelligent, human-like responses to your questions
User-Friendly Interface: Easy-to-use web interface built with Streamlit

Requirements

Python 3.8+
Claude API key (from Anthropic)

Installation

Clone this repository or download the files
Create a virtual environment and activate it:

python -m venv venv

# On Windows:
venv\Scripts\activate

# On macOS/Linux:
source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Create a .env file in the project directory with your Claude API key:

ANTHROPIC_API_KEY=your_api_key_here

Usage

1. Prepare your documents

Place your document files in a folder called documents in the project directory. The system supports:

PDF files (.pdf)
Word documents (.docx, .doc)
Text files (.txt)
Image files with text (.jpg, .jpeg, .png, .bmp, .tiff, .tif)

2. Process the documents

Run the following command to process your documents and create the vector database:

python process_documents.py

You can specify custom folders if needed:

python process_documents.py --docs_folder custom_docs --db_folder custom_db

3. Run Streamlit to access the user interface

Then start the app:

streamlit run app.py

4. Open your browser

Go http://localhost:8501 if it doesn't open automatically.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
chatbot.py		chatbot.py
document_processor.py		document_processor.py
process_documents.py		process_documents.py
requirements.txt		requirements.txt
vector_db.py		vector_db.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal Document Chatbot

Features

Requirements

Installation

Usage

1. Prepare your documents

2. Process the documents

3. Run Streamlit to access the user interface

4. Open your browser

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

sw-sys/document_chatbot

Folders and files

Latest commit

History

Repository files navigation

Multimodal Document Chatbot

Features

Requirements

Installation

Usage

1. Prepare your documents

2. Process the documents

3. Run Streamlit to access the user interface

4. Open your browser

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages