GitHub - trustgraph-ai/trustgraph: The Knowledge Platform - expedite delivery of your knowledge to AI. Build, ship, and manage anywhere from local, cloud, on-prem, or edge.

The Knowledge Platform for AI

📑 Full Docs 📺 YouTube 🔧 Configuration Builder ⚙️ API Docs 🧑‍💻 CLI Docs 💬 Discord 📖 Blog

TrustGraph streamlines the delivery and management of knowledge to AI, acting as a comprehensive knowledge platform for your containerized AI tools, pipelines, and integrations.

Deploying state-of-the-art AI requires managing a complex web of models, frameworks, data pipelines, and monitoring tools. TrustGraph simplifies this complexity by providing a unified, open-source platform to configure, build, and ship a complete knowledge solution anywhere you need it – from cloud, on-prem, or edge devices.

Table of Contents

🎯 Why TrustGraph?
🚀 Getting Started
🔧 Configuration Builder
🔎 TrustRAG
🧠 Knowledge Cores
📐 Architecture
🧩 Integrations
📊 Observability & Telemetry
🤝 Contributing
📄 License
📞 Support & Community

🎯 Why TrustGraph?

Unified Knowledge: Define and deploy complete knowledge environments, including models, dependencies, and tooling, as a single, manageable unit.
No-code TrustRAG Pipelines: Deploy full end-to-end RAG pipelines using unique TrustGraph algorithms leveraging both Knowledge graphs and VectorDBs.
Environment-Agnostic Deployment: Provision consistently across diverse infrastructures (Cloud, On-Prem, Edge, Dev environments). Build once, provision anywhere.
Trusted & Secure Delivery: Focuses on providing a secure supply chain for AI components.
Simplified Operations: Radically reduce the complexity and time required to stand up and manage sophisticated AI stacks. Get operational faster.
Open Source & Extensible: Built with transparency and community collaboration in mind. Easily inspect, modify, and extend the platform to meet your specific provisioning needs.
Component Flexibility: Avoid component lock-in. TrustGraph integrates multiple options for all system components.

Developer APIs and CLI

See the API Developer's Guide for more information.

For users, TrustGraph has the following interfaces:

Configuration Builder
Test Suite

The trustgraph-cli installs the commands for interacting with TrustGraph while running along with the Python SDK. The Configuration Builder enables customization of TrustGraph deployments prior to launching. The REST API can be accessed through port 8088 of the TrustGraph host machine with JSON request and response bodies.

Install the TrustGraph CLI

pip3 install trustgraph-cli==<trustgraph-version>

Caution

The trustgraph-cli version must match the selected TrustGraph release version.

🔧 Configuration Builder

TrustGraph is endlessly customizable by editing the YAML resource files. The Configuration Builder provides a tool for building a custom configuration that deploys with your selected orchestration method in your target environment.

Configuration Builder 🚀

The Configuration Builder has 5 important sections:

🚢 TrustGraph Version: Select the version of TrustGraph you'd like to deploy
✅ Component Selection: Choose from the available deployment platforms, LLMs, graph store, VectorDB, chunking algorithm, chunking parameters, and LLM parameters
🧰 Customization: Customize the prompts for the LLM System, Data Extraction Agents, and Agent Flow
🕵️ Test Suite: Add the Test Suite to the configuration available on port 8888
🚀 Finish Deployment: Download the launch YAML files with deployment instructions

The Configuration Builder will generate the YAML files in deploy.zip. Once deploy.zip has been downloaded and unzipped, launching TrustGraph is as simple as navigating to the deploy directory and running:

docker compose up -d

Tip

Docker is the recommended container orchestration platform for first getting started with TrustGraph.

When finished, shutting down TrustGraph is as simple as:

docker compose down -v

Platform Restarts

The -v flag will destroy all data on shut down. To restart the system, it's necessary to keep the volumes. To keep the volumes, shut down without the -v flag:

docker compose down

With the volumes preserved, restarting the system is as simple as:

docker compose up -d

All data previously in TrustGraph will be saved and usable on restart.

Test Suite

If added to the build in the Configuration Builder, the Test Suite will be available at port 8888. The Test Suite has the following capabilities:

Graph RAG Chat 💬: Graph RAG queries in a chat interface
Vector Search 🔎: Semantic similarity search with cosine similarity scores
Semantic Relationships 🕵️: See semantic relationships in a list structure
Graph Visualizer 🌐: Visualize semantic relationships in 3D
Data Loader 📂: Directly load .pdf, .txt, or .md into the system with document metadata

Example TrustGraph Notebooks

TrustGraph is fully containerized and is launched with a YAML configuration file. Unzipping the deploy.zip will add the deploy directory with the following subdirectories:

docker-compose
minikube-k8s
gcp-k8s

Note

As more integrations have been added, the number of possible combinations of configurations has become quite large. It is recommended to use the Configuration Builder to build your deployment configuration. Each directory contains YAML configuration files for the default component selections.

Docker:

docker compose -f <launch-file.yaml> up -d

Kubernetes:

kubectl apply -f <launch-file.yaml>

TrustGraph is designed to be modular to support as many LLMs and environments as possible. A natural fit for a modular architecture is to decompose functions into a set of modules connected through a pub/sub backbone. Apache Pulsar serves as this pub/sub backbone. Pulsar acts as the data broker managing data processing queues connected to procesing modules.

🔎 TrustRAG

TrustGraph incorporates TrustRAG, an advanced RAG approach that leverages automatically constructed Knowledge Graphs to provide richer and more accurate context to LLMs. Instead of relying solely on unstructured text chunks, TrustRAG understands and utilizes the relationships between pieces of information.

How TrustRAG Works:

Automated Knowledge Graph Construction:
- TrustGraph processes source data to automatically extract key entities, topics, and the relationships connecting them.
- It then maps these extracted semantic relationships and concepts to high-dimensional vector embeddings, capturing the nuanced meaning beyond simple keyword matching.
Hybrid Retrieval Process:
- When a query is received, TrustRAG first performs a cosine similarity search on the vector embeddings to identify potentially relevant concepts and relationships within the knowledge graph.
- This initial vector search pinpoints relevant entry points within the structured Knowledge Graph.
Context Generation via Subgraph Traversal:
- Based on the ranked results from the similarity search, TrustRAG dynamically generates relevant subgraphs.
- It starts from the identified entry points and traverses the connections within the Knowledge Graph. Users can configure the number of 'hops' (relationship traversals) to expand the contextual window, gathering interconnected information.
- This structured subgraph, containing entities and their relationships, forms a highly relevant and context-aware input prompt for the LLM that is endlessly configurable with options for the number of entities, relationships, and overall subgraph size.

🧠 Knowledge Cores

One of the biggest challenges currently facing RAG architectures is the ability to quickly reuse and integrate knowledge sets. TrustGraph solves this problem by storing the results of the document ingestion process in reusable Knowledge Cores. Being able to store and reuse the Knowledge Cores means the process has to be run only once for a set of documents. These reusable Knowledge Cores can be loaded back into TrustGraph and used for TrustRAG.

A Knowledge Core has two components:

Set of Graph Edges
Set of mapped Vector Embeddings

When a Knowledge Core is loaded into TrustGraph, the corresponding graph edges and vector embeddings are queued and loaded into the chosen graph and vector stores.

📐 Architecture

As a full-stack platform, TrustGraph provides all the stack layers needed to connect the data layer to the app layer for autonomous operations.

🧩 Integrations

TrustGraph seamlessly integrates API services, data stores, observability, telemetry, and control flow for a unified platform experience.

LLM Providers: Anthropic, AWS Bedrock, AzureAI, AzureOpenAI, Cohere, Google AI Studio, Google VertexAI, Llamafiles, LM Studio, Mistral, Ollama, and OpenAI
Vector Databases: Qdrant, Pinecone, and Milvus
Knowledge Graphs: Memgraph, Neo4j, and FalkorDB
Data Stores: Apache Cassandra
Observability: Prometheus and Grafana
Control Flow: Apache Pulsar

Pulsar Control Flows

For control flows, Pulsar accepts the output of a processing module and queues it for input to the next subscribed module.
For services such as LLMs and embeddings, Pulsar provides a client/server model. A Pulsar queue is used as the input to the service. When processed, the output is then delivered to a separate queue where a client subscriber can request that output.

Document Extraction Agents

TrustGraph extracts knowledge documents to an ultra-dense knowledge graph using 3 automonous data extraction agents. These agents focus on individual elements needed to build the knowledge graph. The agents are:

Topic Extraction Agent
Entity Extraction Agent
Relationship Extraction Agent

The agent prompts are built through templates, enabling customized data extraction agents for a specific use case. The data extraction agents are launched automatically with the loader commands.

PDF file:

tg-load-pdf <document.pdf>

Text or Markdown file:

tg-load-text <document.txt>

Graph RAG Queries

Once the knowledge graph and embeddings have been built or a cognitive core has been loaded, RAG queries are launched with a single line:

tg-invoke-graph-rag -q "What are the top 3 takeaways from the document?"

Agent Flow

Invoking the Agent Flow will use a ReAct style approach the combines Graph RAG and text completion requests to think through a problem solution.

tg-invoke-agent -v -q "Write a blog post on the top 3 takeaways from the document."

Tip

Adding -v to the agent request will return all of the agent manager's thoughts and observations that led to the final response.

📊 Observability & Telemetry

Once the platform is running, access the Grafana dashboard at:

http://localhost:3000

Default credentials are:

user: admin
password: admin

The default Grafana dashboard tracks the following:

LLM Latency
Error Rate
Service Request Rates
Queue Backlogs
Chunking Histogram
Error Source by Service
Rate Limit Events
CPU usage by Service
Memory usage by Service
Models Deployed
Token Throughput (Tokens/second)
Cost Throughput (Cost/second)

🤝 Contributing

Developing for TrustGraph

📄 License

TrustGraph is licensed under Apache 2.0.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

📞 Support & Community

Bug Reports & Feature Requests: Discord
Discussions & Questions: Discord
Documentation: Docs

Name		Name	Last commit message	Last commit date
Latest commit History 729 Commits
.github/workflows		.github/workflows
containers		containers
docs		docs
grafana		grafana
prometheus		prometheus
test-api		test-api
tests		tests
trustgraph-base		trustgraph-base
trustgraph-bedrock		trustgraph-bedrock
trustgraph-cli		trustgraph-cli
trustgraph-embeddings-hf		trustgraph-embeddings-hf
trustgraph-flow		trustgraph-flow
trustgraph-ocr		trustgraph-ocr
trustgraph-vertexai		trustgraph-vertexai
trustgraph		trustgraph
.gitignore		.gitignore
Containerfile		Containerfile
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
TG-future-agents.svg		TG-future-agents.svg
TG-future-horizon.svg		TG-future-horizon.svg
TG-horizon-repo.svg		TG-horizon-repo.svg
TG-layer-diagram.svg		TG-layer-diagram.svg
TG-ship.jpg		TG-ship.jpg
TG_layer_diagram.svg		TG_layer_diagram.svg
requirements.txt		requirements.txt
schema.ttl		schema.ttl
tg-arch-diagram.svg		tg-arch-diagram.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Knowledge Platform for AI

🎯 Why TrustGraph?

🚀 Getting Started

Developer APIs and CLI

Install the TrustGraph CLI

🔧 Configuration Builder

Platform Restarts

Test Suite

Example TrustGraph Notebooks

🔎 TrustRAG

🧠 Knowledge Cores

📐 Architecture

🧩 Integrations

Pulsar Control Flows

Document Extraction Agents

Graph RAG Queries

Agent Flow

📊 Observability & Telemetry

🤝 Contributing

📄 License

📞 Support & Community

About

Contributors 4

Languages

License

trustgraph-ai/trustgraph

Folders and files

Latest commit

History

Repository files navigation

The Knowledge Platform for AI

🎯 Why TrustGraph?

🚀 Getting Started

Developer APIs and CLI

Install the TrustGraph CLI

🔧 Configuration Builder

Platform Restarts

Test Suite

Example TrustGraph Notebooks

🔎 TrustRAG

🧠 Knowledge Cores

📐 Architecture

🧩 Integrations

Pulsar Control Flows

Document Extraction Agents

Graph RAG Queries

Agent Flow

📊 Observability & Telemetry

🤝 Contributing

📄 License

📞 Support & Community

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

Languages