feat: saturn query pipeline for rag optimisation #23

yekkhan-liftoff · 2025-10-31T14:14:02Z

Summary

Enhanced RAG pipeline with query enhancement and voyage embedding. Query enhancement happens at Slack client layer for all queries, embeddings generated via Voyage AI, and S3 vectors used for similarity search.

Flow

Stage 1: Query Enhancement (Slack Layer)

User Query: "What were Q3 2025 revenues for VX in APAC?"
↓
Slack Client → Query Enhancer (Claude Sonnet)
↓
EnhanceQuery(query, today="2025-11-03")
↓
Analyze: temporal query detected
↓
Output: {
enhancedQuery: "Q3 2025 VX revenues APAC",
metadata: { generatedDate: "2025-09-30" }
}
↓
Date Range Expanded: 2025-09-23 to 2025-09-30 (7-day window)

Stage 2: LLM Tool Detection

Slack Client → Main LLM (GPT-4.1)
↓
CallLLM(enhancedQuery, systemPrompt)
↓
LLM detects: needs rag_search tool
↓
Output: ToolCall {
name: "rag_search",
args: { query: "Q3 2025 VX revenues APAC" }
}

Stage 3: RAG Search with Embeddings

Slack Client → LLM-MCP Bridge
↓
ProcessLLMResponse(extraArgs: {metadata})
↓
Bridge → RAG Client
↓
CallTool("rag_search", args + metadata)
↓
RAG extracts metadata from args
↓
Build SearchOptions {
limit: 30,
dateFilter: ["2025-09-23"..."2025-09-30"]
}
↓
RAG Client → Voyage AI Embedding Provider
↓
POST /v1/embeddings
model: voyage-context-3
dimensions: 1024
↓
Output: queryVector [1024 floats]
↓
RAG Client → AWS S3 Vectors
↓
QueryVectors {
vector: queryVector,
topK: 7,
filter: { report_generated_date: {$in: dates} }
}
↓
S3: Cosine similarity search + metadata filtering
↓
Output: 7 results (scored & sorted)
↓
RAG: sortResultsByDate() - newest first
↓
Format: "Found 7 contexts...
--- Context 1 ---
Source: revenue_report.pdf
Date: 2025-09-30
Content: ..."

Stage 4: Final Synthesis

Bridge → Main LLM
↓
Re-prompt {
query: enhancedQuery,
context: RAG results
}
↓
LLM synthesizes answer from context
↓
Output: "Q3 2025 VX revenues in APAC were..."
↓
Bridge → Slack Client → User

wyangsun · 2025-11-03T09:10:10Z

Could we use Langchain Go's embedding framework, it support Voyage embedding model.
https://github.com/tmc/langchaingo/blob/main/embeddings/voyageai/voyageai.go
https://github.com/tmc/langchaingo/blob/main/embeddings/embedding.go

yekkhan-liftoff · 2025-11-03T09:24:00Z

Could we use Langchain Go's embedding framework, it support Voyage embedding model. https://github.com/tmc/langchaingo/blob/main/embeddings/voyageai/voyageai.go https://github.com/tmc/langchaingo/blob/main/embeddings/embedding.go

the model that i am using is not supported yet

internal/config/config.go

internal/rag/s3_provider.go

internal/handlers/llm_mcp_bridge.go

internal/rag/s3_provider.go

internal/rag/client.go

yekkhan-liftoff · 2025-11-10T07:17:27Z

todo: inject query enhancement prompt instead of hardcoding in application

…-rag-optimisation

tommynguyen-vungle

LGTM

* feat: saturn query pipeline for rag optimisation * feat: remove hardcoded limit * feat: remove unused metadata * feat: todo comments * feat: todo comments * feat: decouple query rewriting and rag search * chore: remove unused comments * fix: fix missing s3 config, make embedding model configurable * feat: debug voyage api key * feat: substitute rag embedding provider env var, remove debug log * feat: add IRSA support to service account template * feat: add logs * feat: add observability for query enhancement * fix: fix empty input and tool name for tool-execution span * feat: added embedding span and fixed incorrect token usage * feat: vector search span * feat: make date filter field configurable * feat: let llm handles the date window * feat: inject query enhancement prompt * feat: handle corrupted metadata * fix: fix race condition in S3Provider.Initialize() * perf(rag): optimize result sorting from O(n²) to O(n log n) * fix: sort dates in descending order, better for LLM * fix: fix test * fix: fix golangci lint err * fix: remove redundant metadata filtering * refactor: dates filter are stored as int * refactor: dates filter are stored as int * fix: fix lint

* PE-7777: Claude Sonnet 4.5 integration * Support thinking for Claude Sonnet 4.5 * include Thinking Output In Response * Fixed thinking messaged deletion to get thread replies (tuannvm#143) Signed-off-by: rangamani54 <[email protected]> * ci(cursor): Add Cursor automated code-review workflow Signed-off-by: Tommy Nguyen <[email protected]> * feat: saturn query pipeline for rag optimisation (#23) * feat: saturn query pipeline for rag optimisation * feat: remove hardcoded limit * feat: remove unused metadata * feat: todo comments * feat: todo comments * feat: decouple query rewriting and rag search * chore: remove unused comments * fix: fix missing s3 config, make embedding model configurable * feat: debug voyage api key * feat: substitute rag embedding provider env var, remove debug log * feat: add IRSA support to service account template * feat: add logs * feat: add observability for query enhancement * fix: fix empty input and tool name for tool-execution span * feat: added embedding span and fixed incorrect token usage * feat: vector search span * feat: make date filter field configurable * feat: let llm handles the date window * feat: inject query enhancement prompt * feat: handle corrupted metadata * fix: fix race condition in S3Provider.Initialize() * perf(rag): optimize result sorting from O(n²) to O(n log n) * fix: sort dates in descending order, better for LLM * fix: fix test * fix: fix golangci lint err * fix: remove redundant metadata filtering * refactor: dates filter are stored as int * refactor: dates filter are stored as int * fix: fix lint * update CLAUDE.md --------- Signed-off-by: rangamani54 <[email protected]> Signed-off-by: Tommy Nguyen <[email protected]> Co-authored-by: Ranga Mani Kumar <[email protected]> Co-authored-by: Tommy Nguyen <[email protected]> Co-authored-by: yekkhan-liftoff <[email protected]>

yekkhan-liftoff added 7 commits October 31, 2025 19:32

feat: saturn query pipeline for rag optimisation

0c052c6

feat: remove hardcoded limit

b6c686f

feat: remove unused metadata

afc52a2

feat: todo comments

55816b3

feat: todo comments

2a96ae1

feat: decouple query rewriting and rag search

405e825

chore: remove unused comments

b0f9f39

yekkhan-liftoff added 9 commits November 3, 2025 18:03

fix: fix missing s3 config, make embedding model configurable

4d8cb9e

feat: debug voyage api key

bd49836

feat: substitute rag embedding provider env var, remove debug log

4999090

feat: add IRSA support to service account template

f0fb4d0

feat: add logs

8d5aaa4

feat: add observability for query enhancement

4756a4c

fix: fix empty input and tool name for tool-execution span

540f49e

feat: added embedding span and fixed incorrect token usage

a6cc1e8

feat: vector search span

de6090d

tommynguyen-vungle reviewed Nov 6, 2025

View reviewed changes

internal/config/config.go Show resolved Hide resolved

tuannvm reviewed Nov 6, 2025

View reviewed changes

internal/rag/s3_provider.go Show resolved Hide resolved

tommynguyen-vungle reviewed Nov 6, 2025

View reviewed changes

internal/handlers/llm_mcp_bridge.go Show resolved Hide resolved

tommynguyen-vungle reviewed Nov 6, 2025

View reviewed changes

internal/rag/s3_provider.go Show resolved Hide resolved

tommynguyen-vungle reviewed Nov 6, 2025

View reviewed changes

internal/rag/client.go Outdated Show resolved Hide resolved

yekkhan-liftoff added 2 commits November 10, 2025 12:24

feat: make date filter field configurable

17b2ac7

feat: let llm handles the date window

810cb24

yekkhan-liftoff added 4 commits November 10, 2025 15:40

feat: inject query enhancement prompt

cb8f858

feat: handle corrupted metadata

e679bef

fix: fix race condition in S3Provider.Initialize()

e49ae77

perf(rag): optimize result sorting from O(n²) to O(n log n)

95a82d5

yekkhan-liftoff added 4 commits November 10, 2025 19:31

fix: sort dates in descending order, better for LLM

4fe2c08

fix: fix test

a356c12

fix: fix golangci lint err

b2cf01a

Merge branch 'refs/heads/main' into PE-7705-saturn-query-pipeline-for…

8edefb3

…-rag-optimisation

tommynguyen-vungle approved these changes Nov 12, 2025

View reviewed changes

yekkhan-liftoff added 4 commits November 14, 2025 20:19

fix: remove redundant metadata filtering

cc21015

refactor: dates filter are stored as int

ee61226

refactor: dates filter are stored as int

fa192ab

fix: fix lint

9177784

yekkhan-liftoff merged commit 3cf3966 into main Nov 24, 2025
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: saturn query pipeline for rag optimisation #23

feat: saturn query pipeline for rag optimisation #23

Uh oh!

yekkhan-liftoff commented Oct 31, 2025 •

edited

Loading

Uh oh!

wyangsun commented Nov 3, 2025 •

edited

Loading

Uh oh!

yekkhan-liftoff commented Nov 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yekkhan-liftoff commented Nov 10, 2025

Uh oh!

tommynguyen-vungle left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: saturn query pipeline for rag optimisation #23

feat: saturn query pipeline for rag optimisation #23

Uh oh!

Conversation

yekkhan-liftoff commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Flow

Stage 1: Query Enhancement (Slack Layer)

Stage 2: LLM Tool Detection

Stage 3: RAG Search with Embeddings

Stage 4: Final Synthesis

Uh oh!

wyangsun commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yekkhan-liftoff commented Nov 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yekkhan-liftoff commented Nov 10, 2025

Uh oh!

tommynguyen-vungle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yekkhan-liftoff commented Oct 31, 2025 •

edited

Loading

wyangsun commented Nov 3, 2025 •

edited

Loading