Advanced Features

Multimodal Document Processing (RAG-Anything Integration)

LightRAG integrates with RAG-Anything, an All-in-One Multimodal Document Processing RAG system that enables advanced parsing and RAG capabilities across diverse document formats including PDFs, images, Office documents, tables, and formulas.

Key Features:

End-to-End Multimodal Pipeline: complete workflow from document ingestion to multimodal query answering
Universal Document Support: PDFs, Office documents (DOC/DOCX/PPT/PPTX/XLS/XLSX), images, and diverse file formats
Specialized Content Analysis: dedicated processors for images, tables, mathematical equations
Multimodal Knowledge Graph: automatic entity extraction and cross-modal relationship discovery
Hybrid Intelligent Retrieval: advanced search spanning textual and multimodal content

Quick Start

Install Rag-Anything

pip install raganything

RAGAnything Usage Example

import asyncio
from raganything import RAGAnything
from lightrag import LightRAG
from lightrag.llm.openai import openai_complete_if_cache, openai_embed
from lightrag.utils import EmbeddingFunc
import os

async def load_existing_lightrag():
    lightrag_working_dir = "./existing_lightrag_storage"

    from functools import partial

    lightrag_instance = LightRAG(
        working_dir=lightrag_working_dir,
        llm_model_func=lambda prompt, system_prompt=None, history_messages=[], **kwargs: openai_complete_if_cache(
            "gpt-4o-mini",
            prompt,
            system_prompt=system_prompt,
            history_messages=history_messages,
            api_key="your-api-key",
            **kwargs,
        ),
        embedding_func=EmbeddingFunc(
            embedding_dim=3072,
            max_token_size=8192,
            model="text-embedding-3-large",
            func=partial(
                openai_embed.func,
                model="text-embedding-3-large",
                api_key=api_key,
                base_url=base_url,
            ),
        )
    )

    await lightrag_instance.initialize_storages()

    rag = RAGAnything(
        lightrag=lightrag_instance,
        vision_model_func=lambda prompt, system_prompt=None, history_messages=[], image_data=None, **kwargs: openai_complete_if_cache(
            "gpt-4o",
            "",
            system_prompt=None,
            history_messages=[],
            messages=[
                {"role": "system", "content": system_prompt} if system_prompt else None,
                {"role": "user", "content": [
                    {"type": "text", "text": prompt},
                    {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{image_data}"}}
                ]} if image_data else {"role": "user", "content": prompt}
            ],
            api_key="your-api-key",
            **kwargs,
        ) if image_data else openai_complete_if_cache(
            "gpt-4o-mini",
            prompt,
            system_prompt=system_prompt,
            history_messages=history_messages,
            api_key="your-api-key",
            **kwargs,
        )
    )

    result = await rag.query_with_multimodal(
        "What data has been processed in this LightRAG instance?",
        mode="hybrid"
    )
    print("Query result:", result)

    await rag.process_document_complete(
        file_path="path/to/new/multimodal_document.pdf",
        output_dir="./output"
    )

if __name__ == "__main__":
    asyncio.run(load_existing_lightrag())

For detailed documentation and advanced usage, see the RAG-Anything repository.

Token Usage Tracking

Overview and Usage

LightRAG provides a TokenTracker tool to monitor and manage token consumption by large language models. This feature is useful for controlling API costs and optimizing performance.

from lightrag.utils import TokenTracker

token_tracker = TokenTracker()

# Method 1: Using context manager (Recommended)
with token_tracker:
    result1 = await llm_model_func("your question 1")
    result2 = await llm_model_func("your question 2")

# Method 2: Manually adding token usage records
token_tracker.reset()

rag.insert()

rag.query("your question 1", param=QueryParam(mode="naive"))
rag.query("your question 2", param=QueryParam(mode="mix"))

print("Token usage:", token_tracker.get_usage())

Usage Tips:

Use context managers for long sessions or batch operations to automatically track all token consumption
For segmented statistics, use manual mode and call reset() when appropriate
Regular checking of token usage helps detect abnormal consumption early

Example files:

examples/lightrag_gemini_track_token_demo.py: Token tracking with Google Gemini
examples/lightrag_siliconcloud_track_token_demo.py: Token tracking with SiliconCloud

Data Export Functions

LightRAG allows you to export your knowledge graph data in various formats for analysis, sharing, and backup.

Basic Usage

# Basic CSV export (default format)
rag.export_data("knowledge_graph.csv")

# Specify any format
rag.export_data("output.xlsx", file_format="excel")

Supported File Formats

rag.export_data("graph_data.csv", file_format="csv")
rag.export_data("graph_data.xlsx", file_format="excel")
rag.export_data("graph_data.md", file_format="md")
rag.export_data("graph_data.txt", file_format="txt")

Additional Options

Include vector embeddings in the export (optional):

rag.export_data("complete_data.csv", include_vector_data=True)

All exports include entity information (names, IDs, metadata), relation data (connections between entities), and relationship information from the vector database.

Cache Management

Clear Cache

aclear_cache() clears all cached entries in llm_response_cache. It does not support selective cleanup by mode or cache type.

# Asynchronous
await rag.aclear_cache()

# Synchronous
rag.clear_cache()

For selective cleanup of query-related caches, use the lightrag.tools.clean_llm_query_cache tool and see the guide in lightrag/tools/README_CLEAN_LLM_QUERY_CACHE.md. It manages query caches and keywords caches for mix, hybrid, local, and global modes. It does not clean extraction caches such as default:extract:* and default:summary:*.

Langfuse Observability Integration

Langfuse provides a drop-in replacement for the OpenAI client that automatically tracks all LLM interactions, enabling developers to monitor, debug, and optimize their RAG systems.

Installation

pip install lightrag-hku[observability]
# Or from source:
pip install -e ".[observability]"

Configuration

Add to .env file:

## Langfuse Observability (Optional)
LANGFUSE_SECRET_KEY=""
LANGFUSE_PUBLIC_KEY=""
LANGFUSE_HOST="https://cloud.langfuse.com"  # or your self-hosted instance
LANGFUSE_ENABLE_TRACE=true

Features

Once installed and configured, Langfuse automatically traces all OpenAI LLM calls. Dashboard features include:

Tracing: View complete LLM call chains
Analytics: Token usage, latency, cost metrics
Debugging: Inspect prompts and responses
Evaluation: Compare model outputs
Monitoring: Real-time alerting

Note: LightRAG currently only integrates OpenAI-compatible API calls with Langfuse. APIs such as Ollama, Azure, and AWS Bedrock are not yet supported for Langfuse observability.

RAGAS-based Evaluation

RAGAS (Retrieval Augmented Generation Assessment) is a framework for reference-free evaluation of RAG systems using LLMs. LightRAG provides an evaluation script based on RAGAS. For detailed information, see RAGAS-based Evaluation Framework.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Advanced Features

Multimodal Document Processing (RAG-Anything Integration)

Quick Start

Token Usage Tracking

Data Export Functions

Cache Management

Langfuse Observability Integration

Installation

Configuration

Features

RAGAS-based Evaluation

FilesExpand file tree

AdvancedFeatures.md

Latest commit

History

AdvancedFeatures.md

File metadata and controls

Advanced Features

Multimodal Document Processing (RAG-Anything Integration)

Quick Start

Token Usage Tracking

Data Export Functions

Cache Management

Langfuse Observability Integration

Installation

Configuration

Features

RAGAS-based Evaluation