Skip to content

This project demonstrates how to setup a complete Retrieval Augmented Generation (RAG) pipeline on medical data using Llama-3-8B model

Notifications You must be signed in to change notification settings

Siddhesh19991/RAG_Medical_Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAG_Medical_Data

This repository demonstrates the implementation of an RAG pipeline using Llama-3-8B. It is part of a comparative study between fine-tuning and Retrieval-Augmented Generation (RAG) to determine which approach is more suitable for our use case.

The detailed blog can be found here.

RAG_medical.ipynb contains all the code necessary for setting up the RAG pipeline

Dataset

For this project, we will be using publicly available medical data. This dataset is structured as prompt-completion pairs, where users ask medical questions and receive relevant responses from doctors. (Data Source)

Overview of the pipeline: Screenshot 2024-07-25 at 3 44 35 PM

For questions or feedback about the project, don't hesitate to reach out to me on LinkedIn.

The fine-tuning implementation for this study can be found here.

About

This project demonstrates how to setup a complete Retrieval Augmented Generation (RAG) pipeline on medical data using Llama-3-8B model

Topics

Resources

Stars

Watchers

Forks