RAG_Medical_Data

This repository demonstrates the implementation of an RAG pipeline using Llama-3-8B. It is part of a comparative study between fine-tuning and Retrieval-Augmented Generation (RAG) to determine which approach is more suitable for our use case.

The detailed blog can be found here.

RAG_medical.ipynb contains all the code necessary for setting up the RAG pipeline

Dataset

For this project, we will be using publicly available medical data. This dataset is structured as prompt-completion pairs, where users ask medical questions and receive relevant responses from doctors. (Data Source)

Overview of the pipeline:

For questions or feedback about the project, don't hesitate to reach out to me on LinkedIn.

The fine-tuning implementation for this study can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
Medical_data.json		Medical_data.json
RAG_Medical.ipynb		RAG_Medical.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG_Medical_Data

Dataset

About

Languages

Siddhesh19991/RAG_Medical_Data

Folders and files

Latest commit

History

Repository files navigation

RAG_Medical_Data

Dataset

About

Topics

Resources

Stars

Watchers

Forks

Languages