Skip to content

PDFMate.AI is a Django-based app that lets you upload PDFs, indexes them into a vector database, and ask natural-language questions to get grounded answers with evidence. It uses PyMuPDF for PDF parsing, Transformers + PyTorch for embeddings, and Pinecone for fast semantic search. Clean templates provide Q&A views with cited contexts.

Notifications You must be signed in to change notification settings

saksham7685/PDFMate.Ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDFMate.Ai

Your Personalised Ai powered PDF buddy !!

Django-based RAG app to upload PDFs, index them into Pinecone using embeddings from Hugging Face and use ollama distilled model for RAG inference, and answer questions with Ollama zephyr:7b-alpha.

Features

  • Upload PDF and auto-split into chunks
  • Embeddings via all-mpnet-base-v2
  • Pinecone v3 serverless vector store
  • RAG inference via local Ollama zephyr-7b-alpha (distilled model)
  • Evidence contexts

Setup

  1. Python 3.10+ and PyTorch.
  2. Install Ollama and pull model:
    • ollama pull zephyr:7b-alpha
  3. Create and fill .env (see .env.example).
  4. Install dependencies:
    • pip install -r requirements.txt
  5. Django init:
    • python manage.py migrate
    • python manage.py runserver
  6. Open http://127.0.0.1:8000

Environment Variables

See .env.example for all settings:

  • PINECONE_API_KEY required.
  • If the Hugging Face model is gated, set HUGGINGFACE_TOKEN.
  • Ensure Ollama is running locally at OLLAMA_BASE_URL.

About

PDFMate.AI is a Django-based app that lets you upload PDFs, indexes them into a vector database, and ask natural-language questions to get grounded answers with evidence. It uses PyMuPDF for PDF parsing, Transformers + PyTorch for embeddings, and Pinecone for fast semantic search. Clean templates provide Q&A views with cited contexts.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published