GraphRAG

Update TEI docker image to 1.6 (opea-project#1650 )

Mar 27, 2025

87baeb8 · Mar 27, 2025

This branch is 1 commit behind opea-project/GenAIExamples:main.

Name	Name	Last commit message	Last commit date
parent directory ..
assets	assets	Graph rag (opea-project#1007 )	Oct 30, 2024
docker_compose	docker_compose	Update TEI docker image to 1.6 (opea-project#1650 )	Mar 27, 2025
docker_image_build	docker_image_build	Adapt code for dataprep microservice refactor (opea-project#1408 )	Jan 20, 2025
tests	tests	Update TEI docker image to 1.6 (opea-project#1650 )	Mar 27, 2025
ui	ui	Fix Domain Access Issue in Latest Vite Version (opea-project#1444 )	Jan 21, 2025
Dockerfile	Dockerfile	Use GenAIComp base image to simplify Dockerfiles & reduce image sizes…	Mar 13, 2025
README.md	README.md	bugfix GraphRAG updated docker compose and env settings to fix issues…	Mar 4, 2025
graphrag.py	graphrag.py	Fix "OpenAI" & "response" spelling (opea-project#1561 )	Feb 25, 2025

README.md

GraphRAG Application

While naive RAG works well in fetching precise information it fails on global questions directed at an entire text corpus, such as "What are the main themes in the dataset?". GraphRAG was introduced by Microsoft paper "From Local to Global: A Graph RAG Approach to Query-Focused Summarization". The key elements are:

Uses LLM to derive an entity knowledge graph from the source documents
Uses hierarchical leiden algorithm to identify communities of closely-related entities and summaries are extracted for each community
For an input query the relevant communities are identified and partial answers are generated from each of the community summaries (query-focused summarization (QFS))
There is a final generation stage that responds to the query based on the intermediate community answers.

Deploy GraphRAG Service

The GraphRAG service can be effortlessly deployed on Intel Gaudi2, Intel Xeon Scalable Processors.

Quick Start Deployment Steps:

Set up the environment variables.
Run Docker Compose.
Consume the GraphRAG Service.

Note: If you do not have docker installed you can run this script to install docker : bash docker_compose/install_docker.sh

Quick Start: 1.Setup Environment Variable

To set up environment variables for deploying GraphRAG services, follow these steps:

Set the required private environment variables:

export host_ip=${your_hostname IP} #local IP, i.e "192.168.1.1"
export NEO4J_URI=${your_neo4j_url}
export NEO4J_USERNAME=${your_neo4j_username}
export NEO4J_PASSWORD=${your_neo4j_password}
export PYTHONPATH=${path_to_comps}
export OPENAI_KEY=${your_openai_api_key} #optional, when not provided will use smaller models TGI/TEI
export HUGGINGFACEHUB_API_TOKEN=${your_hf_token} #needed for TGI/TEI models

If you are in a proxy environment, also set the proxy-related environment variables:

export http_proxy="Your_HTTP_Proxy"
export https_proxy="Your_HTTPs_Proxy"
export no_proxy=$no_proxy,${host_ip} #important to add {host_ip} for containers communication

Set up other environment variables:

# on Gaudi
source ./docker_compose/intel/hpu/gaudi/set_env.sh

Quick Start: 2.Run Docker Compose

If the microservice images are available in Docker Hub they will be pulled, otherwise you will need to build the container images manually. Please refer to the 'Build Docker Images' in Guide. test_compose_on_gaudi.sh can be a good resource as it shows how to do image build, starting services, validated each microservices and megaservices. This is what is used in CI/CD.

Docker compose will start 8 services:

cd GraphRAG/docker_compose/intel/hpu/gaudi
docker compose -f compose.yaml up -d

QuickStart: 3.Upload RAG Files and Consume the GraphRAG Service

To chat with retrieved information, you need to upload a file using Dataprep service.

Here is an example of Nike 2023 pdf.

# download pdf file
wget https://raw.githubusercontent.com/opea-project/GenAIComps/v1.1/comps/retrievers/redis/data/nke-10k-2023.pdf
# upload pdf file with dataprep
curl -X POST "http://${host_ip}:11103/v1/dataprep/ingest" \
    -H "Content-Type: multipart/form-data" \
    -F "files=@./nke-10k-2023.pdf"

curl http://${host_ip}:8888/v1/graphrag \
    -H "Content-Type: application/json"  \
    -d '{"messages": [{"role": "user","content": "where do Nike subsidiaries operate?
    "}]}'

Architecture and Deploy details

The GraphRAG example is implemented using the component-level microservices defined in GenAIComps. The flow chart below shows the information flow between different microservices for this example.

---
config:
  flowchart:
    nodeSpacing: 400
    rankSpacing: 100
    curve: linear
  themeVariables:
    fontSize: 50px
---
flowchart LR
    %% Colors %%
    classDef blue fill:#ADD8E6,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
    classDef orange fill:#FBAA60,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
    classDef orchid fill:#C26DBC,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
    classDef invisible fill:transparent,stroke:transparent;
    style GraphRAG-MegaService stroke:#000000

    %% Subgraphs %%
    subgraph GraphRAG-MegaService["GraphRAG MegaService "]
        direction LR
        RET([Retrieval MicroService]):::blue
        LLM([LLM MicroService]):::blue
        EM([Embedding MicroService]):::blue
    end
    subgraph UserInterface[" User Interface "]
        direction LR
        a([User Input Query]):::orchid
        Ingest([Ingest data]):::orchid
        UI([UI server<br>]):::orchid
    end


    GDB{{Graph DB<br><br>}}
    DP([Data Preparation MicroService]):::blue
    GW([GraphRAG GateWay<br>]):::orange


    %% Data Preparation flow
    %% Ingest data flow
    direction LR
    Ingest[Ingest data] --> UI
    UI --> DP

    %% interactions buried inside the DP and RET microservice implementations
    DP <-.-> EM
    DP <-.-> LLM
    RET <-.-> EM
    RET <-.-> LLM


    %% Questions interaction
    direction LR
    a[User Input Query] --> UI
    UI --> GW
    GW <==> GraphRAG-MegaService
    RET ==> LLM


    direction TB
    %% Graph DB interaction
    RET <-.-> |d|GDB
    DP <-.-> |d|GDB

    linkStyle 2 stroke:#000000,stroke-width:2px;
    linkStyle 3 stroke:#000000,stroke-width:2px;
    linkStyle 4 stroke:#000000,stroke-width:2px;
    linkStyle 5 stroke:#000000,stroke-width:2px;

Note: The Dataprep and Retriever microservices use the LLM Microservice and Embedding Microservice in their implementation. For example, Dataprep uses LLM to extract entities and relationships from text to build graph and Retriever uses LLM to summarize communities (these are clusters of similar entities and their properties). Those endpoint interactions with the corresponding prompt templates are buried in the microservice implementation thus not managed by the megaservice orchestrator scheduler and not exposed in the megaservice. Shown as thin black lines in diagram.

This GraphRAG use case performs RAG using Llama-index, Neo4J Graph Property Store and Text Generation Inference on Intel Gaudi2 or Intel Xeon Scalable Processors. In the below, we provide a table that describes for each microservice component in the GraphRAG architecture, the default configuration of the open source project, hardware, port, and endpoint.

Gaudi default compose.yaml

MicroService	Open Source Project	HW	Port	Endpoint
Embedding	Llama-index	Xeon	6006	/v1/embaddings
Retriever	Llama-index, Neo4j	Xeon	6009	/v1/retrieval
LLM	Llama-index, TGI	Gaudi	6005	/v1/chat/completions
Dataprep	Neo4j, LlamaIndex	Xeon	6004	/v1/dataprep/ingest

Models Selection

GraphRAG quality dependents heavily on the ability to extract a high quality graph. We highly recommend using the best model available to you. Table below shows default models specified in the codebase when OPENAI_API_KEY is available and for local inference w TEI/TGI. The local models are small since those will be used in CI/CD but users should improve upon these by changing the xxx_MODEL_ID in docker_compose/xxx/set_env.sh.

Working on a table comparison of various model sizes vs. naive RAG with a dataset that reflects well the benefits of GraphRAG. Stay tuned!

Service	Model
Embedding	BAAI/bge-base-en-v1.5
Embedding	"text-embedding-3-small"
LLM	gpt-4o
LLM	"meta-llama/Meta-Llama-3-8B-Instruct"

Consume GraphRAG Service with RAG

Check Service Status

Before consuming GraphRAG Service, make sure each microservice is ready by checking the docker logs of each microservice. test_compose_on_gaudi.sh can be a good resource as it shows how CI/CD validated each microservices based on returned HTTP status and response body.

docker logs container_name

Upload RAG Files

To chat with retrieved information, you need to upload a file using Dataprep service.

Here is an example of Nike 2023 pdf.

# download pdf file
wget https://raw.githubusercontent.com/opea-project/GenAIComps/v1.1/comps/retrievers/redis/data/nke-10k-2023.pdf
# upload pdf file with dataprep
curl -X POST "http://${host_ip}:6007/v1/dataprep/ingest" \
    -H "Content-Type: multipart/form-data" \
    -F "files=@./nke-10k-2023.pdf"

Consume GraphRAG Service

Two ways of consuming GraphRAG Service:

Use cURL command on terminal

curl http://${host_ip}:8888/v1/graphrag \
    -H "Content-Type: application/json"  \
    -d '{
        "model": "gpt-4o-mini","messages": [{"role": "user","content": "Who is John Brady and has he had any confrontations?
    "}]}'

Access via frontend

To access the frontend, open the following URL in your browser: http://{host_ip}:5173

By default, the UI runs on port 5173 internally.

If you choose conversational UI, use this URL: http://{host_ip}:5174

Troubleshooting

If you get errors like "Access Denied", validate micro service first. A simple example:

http_proxy="" curl ${host_ip}:6006/embed -X POST  -d '{"inputs":"What is Deep Learning?"}' -H 'Content-Type: application/json'

(Docker only) If all microservices work well, check the port ${host_ip}:8888, the port may be allocated by other users, you can modify the compose.yaml.
(Docker only) If you get errors like "The container name is in use", change container name in compose.yaml.

Monitoring OPEA Service with Prometheus and Grafana dashboard

OPEA microservice deployment can easily be monitored through Grafana dashboards in conjunction with Prometheus data collection. Follow the README to setup Prometheus and Grafana servers and import dashboards to monitor the OPEA service.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

GraphRAG

GraphRAG

README.md

GraphRAG Application

Deploy GraphRAG Service

Quick Start: 1.Setup Environment Variable

Quick Start: 2.Run Docker Compose

QuickStart: 3.Upload RAG Files and Consume the GraphRAG Service

Architecture and Deploy details

Models Selection

Consume GraphRAG Service with RAG

Check Service Status

Upload RAG Files

Consume GraphRAG Service

Troubleshooting

Monitoring OPEA Service with Prometheus and Grafana dashboard

Files

GraphRAG

Directory actions

More options

Directory actions

More options

Latest commit

History

GraphRAG

Folders and files

parent directory

README.md

GraphRAG Application

Deploy GraphRAG Service

Quick Start: 1.Setup Environment Variable

Quick Start: 2.Run Docker Compose

QuickStart: 3.Upload RAG Files and Consume the GraphRAG Service

Architecture and Deploy details

Models Selection

Consume GraphRAG Service with RAG

Check Service Status

Upload RAG Files

Consume GraphRAG Service

Troubleshooting

Monitoring OPEA Service with Prometheus and Grafana dashboard