Switch MCP HTTP server to stateless mode by DavidDwyer87 · Pull Request #624 · tobi/qmd

DavidDwyer87 · 2026-05-04T12:54:10Z

Summary

Convert the MCP HTTP server from stateful session management to stateless mode (sessionIdGenerator: undefined)
Each POST /mcp request now creates a fresh McpServer + WebStandardStreamableHTTPServerTransport, handles the request, then cleans up
GET/DELETE on /mcp now return 405 (not applicable for stateless)
Removed sessions Map, createSession() helper, and related session lifecycle code

Why

When the MCP HTTP server restarts (crash, deploy, OOM), all in-memory sessions are lost. Clients holding old session IDs receive persistent "Session not found" errors and cannot recover without restarting their own process. This is especially painful for remote MCP setups (e.g., type: "remote" in opencode config) where the client and server are on different machines.

Changes

Removed: sessions Map, createSession(), session routing logic, stale session handling
Removed: Unused imports (randomUUID, isInitializeRequest)
Added: Fresh McpServer + stateless transport per POST request, with cleanup after response
Added: 405 response for GET/DELETE on /mcp endpoint
Net: -76 lines

Testing

All verified locally with node dist/cli/qmd.js mcp --http --port 8181 --daemon:

Tool call without prior initialize → works (no session required)
Sequential tool calls → each gets fresh transport, all work
GET /mcp → 405 Method Not Allowed
Initialize request → returns proper capabilities and instructions
Lex search query → returns ranked results
Collections tool → lists all collections
REST endpoints (/health, /query, /search) → unchanged

Implements the LLM interface using the Ollama REST API as an alternative to the default node-llama-cpp local GGUF inference. New files: - src/ollama.ts: OllamaLLM class with embed(), generate(), expandQuery(), rerank(), modelExists(), dispose() methods using Ollama REST API - test/ollama.test.ts: 45 unit tests covering all methods, error handling, configuration, and getDefaultLLM() routing Modified files: - src/llm.ts: Added getDefaultLLM() function that routes to OllamaLLM when QMD_LLM_BACKEND=ollama, otherwise falls back to LlamaCpp Configuration (env vars): - QMD_LLM_BACKEND=ollama — enable Ollama backend - QMD_OLLAMA_BASE_URL — server URL (default: http://localhost:11434) - QMD_OLLAMA_EMBED_MODEL — embedding model (default: nomic-embed-text) - QMD_OLLAMA_GENERATE_MODEL — generation model (default: qwen3:1.7b) - QMD_OLLAMA_RERANK_MODEL — reranking model (default: qwen3:0.6b) Reranking uses chat-based relevance scoring since Ollama has no native rerank API. The model outputs relevance scores which are parsed and normalized to [0, 1]. Also verified MCP server works via both stdio and HTTP transports with all 4 tools (query, get, multi_get, status) accessible.

Adds 9 new tools to the QMD MCP server for full programmatic control: Collection Management: - collections: list all collections with stats - add_collection: add a new collection (name, path, pattern, ignore) - remove_collection: remove a collection by name - rename_collection: rename a collection Context Management: - contexts: list all contexts + global context - add_context: add context to a collection path - remove_context: remove context from a collection path Indexing: - update_index: re-index collections from filesystem - embed: generate vector embeddings for documents Previously the MCP server only exposed 4 read-only tools (query, get, multi_get, status). Now agents can manage collections, set context, and trigger indexing operations remotely via MCP. Test coverage: 20 unit tests covering all new tools. Combined with the existing 4 tools, QMD now exposes 13 MCP tools total.

feat: add Ollama LLM provider for remote model inference

feat: add MCP management tools for collections, contexts, and indexing

…=ollama

…llama When QMD_LLM_BACKEND=ollama is set, QMD no longer downloads or loads local GGUF models. The store layer now uses the generic LLM interface throughout, routing to OllamaLLM when configured. Changes to src/store.ts: - getLlm() returns LLM interface instead of LlamaCpp concrete type - Store.llm field typed as LLM instead of LlamaCpp - All llmOverride params typed as LLM instead of LlamaCpp - All getDefaultLlamaCpp() calls replaced with getDefaultLLM() - chunkDocumentByTokens() uses instanceof guard for backend-specific logic Changes to src/llm.ts: - Added intent to ExpandQueryOptions in LLM interface - Added embedBatch() and embedModelName to LLM interface - Added SimpleLLMSession for non-LlamaCpp backends - Updated withLLMSessionForLlm() to accept LLM interface Changes to src/ollama.ts: - Added embedModelName getter and embedBatch() method - Renamed private fields to avoid interface naming conflicts Changes to test/ollama.test.ts: - Fixed field name references after private field rename

…tion refactor: use LLM interface in store.ts to skip GGUF downloads with Ollama

- /health now returns total indexed documents and docs needing embedding - Update test to verify new fields - Add Jenkinsfile for CI/CD pipeline

… status

Remove in-memory session management in favor of creating a fresh McpServer + transport per request (sessionIdGenerator: undefined). This eliminates 'Session not found' errors that occur when the server restarts and clients hold expired session IDs.

socket-security · 2026-05-04T12:54:37Z

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License
	pypi/torch@2.11.0
	pypi/transformers@4.57.6
	pypi/sentencepiece@0.2.1
	pypi/huggingface-hub@0.36.2
	pypi/gguf@0.18.0
	pypi/trl@1.0.0
	pypi/peft@0.18.1
	pypi/trackio@0.4.1
	pypi/pydantic@2.12.3
	pypi/pyyaml@6.0.3
	pypi/nvidia-ml-py@13.595.45

View full report

David Dwyer and others added 30 commits April 13, 2026 05:48

Merge pull request #1 from cicadialabs/feature/ollama-provider

e6eda1e

feat: add Ollama LLM provider for remote model inference

Merge pull request #2 from cicadialabs/feature/mcp-management-tools

b9dfb96

feat: add MCP management tools for collections, contexts, and indexing

feat: add Dockerfile for Ollama-backed QMD image

fee1adc

fix: add transaction method to Database interface for TS build

1b2776e

fix: respect QMD_MCP_HOST env var for HTTP listen address

d90ea6d

feat: add Ollama embed path to generateEmbeddings for QMD_LLM_BACKEND…

33a5555

…=ollama

Merge pull request #3 from cicadialabs/refactor/llm-interface-abstrac…

2746fea

…tion refactor: use LLM interface in store.ts to skip GGUF downloads with Ollama

feat: add indexedDocuments and needsEmbedding to /health endpoint

9165365

- /health now returns total indexed documents and docs needing embedding - Update test to verify new fields - Add Jenkinsfile for CI/CD pipeline

feat: build multi-arch Docker images (amd64/arm64) in Jenkins pipeline

3c38c83

fix: use HTTP /health endpoint for K8s probes instead of TCP

b462efb

feat: add POST /reindex endpoint to re-index collections

aa4d884

chore: change /reindex to GET method for simple re-indexing

a29749b

fix: add race condition guard for /reindex endpoint

48b9284

feat: add reindexInProgress to /health endpoint

7cd79b6

fix: use promise-based mutex to prevent race condition on /reindex

adeed75

fix: move isReindexing to outer scope so health endpoint sees correct…

801f8d6

… status

fix: move reindex mutex to outer scope (after sessions map)

4cba854

feat: add collections list to /health endpoint

b1cdd44

feat: add POST /collections endpoint to add and index a collection

6724ebc

feat: add Docker build and push stages to pipeline

3252739

feat: combine reindex and embed into single /reindex endpoint

42bcf92

feat: add pendingFiles count to /health endpoint

62ffca0

fix pending file issue

10497f0

jenkins update

5c4b59d

bump version

1fcdf11

bug fix on qmd

aa1131b

disable build stage

f9ea7bc

DavidDwyer87 added 16 commits May 3, 2026 14:40

fix version extraction with awk, remove disabled build stage

70b7a6a

fix version extraction using groovy regex

ac32bd5

fix version extraction: use matcher.group(1)

1aafda0

docker tag fix

802f707

fix jenkins docker tag extraction in sandbox

fd1f17d

fix tag issue

8b96c14

quoting got mangled

cfb7e00

using python to get verson of the tag

a54da47

assign the docker tag

23cc285

troubleshooting credentials

fa6798d

token fail fast troubleshooting

d25006b

remove space from credentials

240feb8

remove buildx

cd646aa

add discord notification

a4b402a

bump the version

0bfb21e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch MCP HTTP server to stateless mode#624

Switch MCP HTTP server to stateless mode#624
DavidDwyer87 wants to merge 46 commits intotobi:mainfrom
cicadialabs:session-bug

DavidDwyer87 commented May 4, 2026

Uh oh!

socket-security Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DavidDwyer87 commented May 4, 2026

Summary

Why

Changes

Testing

Uh oh!

socket-security Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant