flamehaven01
diff --git a/‎.gitignore‎
Lines changed: 14 additions & 1 deletion b/‎.gitignore‎
Lines changed: 14 additions & 1 deletion
diff --git a/‎CHANGELOG.md‎
Lines changed: 98 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 98 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 22 additions & 13 deletions b/‎README.md‎
Lines changed: 22 additions & 13 deletions
diff --git a/‎ROADMAP.md‎
Lines changed: 12 additions & 0 deletions b/‎ROADMAP.md‎
Lines changed: 12 additions & 0 deletions
diff --git a/‎docs/wiki/Architecture.md‎
Lines changed: 72 additions & 8 deletions b/‎docs/wiki/Architecture.md‎
Lines changed: 72 additions & 8 deletions
@@ -156,6 +156,9 @@ doc_sanity_report*.json
 DOC_SANITY_FIXES.md
 PHASE*_COMPLETION_SUMMARY.md
 
+# Historical development artifacts (internal audit reports, design docs — not public docs)
+docs/history/
+
 # Test data
 test_files/
 sample_data/
@@ -174,6 +177,16 @@ logs/
 *.sqlite
 *.sqlite3
 
-# SIDRCE reports (internal quality metrics)
+# SIDRCE reports and slop-detector outputs (internal quality metrics)
 sidrce_report*.json
+sidrce_cert*.yaml
+certification_report*.yaml
 quality_report*.json
+slop_*.json
+slop_report*.json
+
+# Scratch / test artefacts
+test_doc.txt
+
+# CR-EP internal config
+.cr-ep/
@@ -7,6 +7,104 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ---
 
+## [1.6.1] - 2026-04-19
+
+### Refactored
+
+- **API orchestration** (`api.py`): `initialize_services` (66 lines, CC~8) →
+  `_init_searcher` + `_init_cache` + `_init_metrics` + 8-line orchestrator.
+  `_record_upload_failure` extracted — eliminates 2× duplicated
+  `record_file_upload + record_error` blocks in `upload_single_file`.
+
+- **Admin auth** (`admin_routes.py`): `_get_admin_user` (77 lines, CC~10) →
+  `_parse_bearer_token` + `_try_oauth_admin` + `_resolve_key_admin` + 5-line
+  orchestrator. Fixes `reverse_field_glyphs` rebuilt on every recursive call.
+
+- **Engine** (`engine/chronos_grid.py`): `seek_vector_resonance` (80 lines,
+  2 code paths) → `_hnsw_vector_resonance` + `_brute_vector_resonance` +
+  10-line dispatcher (HNSW path vs brute-force cosine similarity).
+
+- **Engine** (`engine/gravitas_pack.py`): `_compress_dict` / `_decompress_dict`
+  clone cluster → `_transform_dict(obj, key_map, value_transform)` dispatch table.
+  Both callers become 2-line delegators.
+
+### Changed
+
+- **`eval_self.py`**: `CORPUS_FILES` split into `AUDIT_CORPUS` (11 docs) +
+  `SOURCE_CORPUS` (7 source files). `CORPUS_FILES = AUDIT_CORPUS + SOURCE_CORPUS`
+  preserves existing full-pack behaviour; `AUDIT_CORPUS` alone enables lightweight
+  doc-quality runs.
+
+- **`.gitignore`**: `docs/history/` added under "Historical development artifacts".
+
+### Tests
+
+- 475 passed, 13 skipped — same count as v1.6.0 (1 pre-existing flaky timing test
+  in full suite; passes in isolation).
+
+---
+
+## [1.6.0] - 2026-04-19
+
+### Added
+
+- **BM25 + RRF Hybrid Search** (`engine/hybrid_search.py`): Production-grade BM25
+  (k1=1.5, b=0.75) with Korean+English tokenizer
+  (`re.findall(r"[a-z0-9\uac00-\ud7a3]+", text.lower())`).
+  Reciprocal Rank Fusion merges BM25 and ChronosGrid semantic lists using
+  string URI as doc ID — no integer alignment required. k=60, top_k configurable.
+  Lazy per-store index with `_bm25_dirty` set: index rebuilt on first hybrid
+  search after any upload, not on every upload.
+
+- **KnowledgeAtom chunk-level indexing** (`engine/knowledge_atom.py`): Two-level
+  indexing — file-level doc + chunk atoms with fragment URIs
+  (`local://store/enc_path#c0001`). `chunk_and_inject()` splits content into
+  800-char overlapping windows (120-char overlap, 80-char minimum), embeds each
+  chunk via `embedding_generator.generate()`, injects into ChronosGrid, and
+  registers in `_atom_store_docs` for URI-based resolution. Enables precision
+  chunk-level retrieval alongside file-level documents.
+
+- **Stable URI scheme**: Local documents now use
+  `local://<store>/<urllib.parse.quote(abs_path, safe='')>` instead of
+  `local://<store>/<basename>`. Eliminates collisions when files with identical
+  names exist in different directories. URIs are reversible via `unquote()`.
+  Both main docs and chunk atoms share the same URI namespace.
+
+### Refactored
+
+- **`core.py` segmentation** (1258 → 221 lines): `FlamehavenFileSearch` split into
+  three focused mixin classes via `IngestMixin`, `LocalSearchMixin`,
+  `CloudSearchMixin`. `core.py` is now a thin orchestrator: `__init__`,
+  `create_store`, `list_stores`, `delete_store`, `get_metrics`,
+  `_resolve_vector_backend`.
+
+  | Mixin | File | Responsibility |
+  |---|---|---|
+  | `IngestMixin` | `_ingest.py` (228 L) | upload_file, upload_files, _local_upload, _generate_file_vector |
+  | `LocalSearchMixin` | `_search_local.py` (273 L) | _local_search, BM25 rebuild, hybrid rerank, RAG prompt |
+  | `CloudSearchMixin` | `_search_cloud.py` (265 L) | search, search_stream, search_multimodal + 6 shared helpers |
+
+- **Duplicate helper elimination** (`_search_cloud.py`): Six blocks that were
+  copy-pasted between `search()` and `search_multimodal()` are now shared helpers:
+  `_resolve_search_params`, `_ensure_store`, `_query_vector_backend`,
+  `_driftlock_validate`, `_extract_grounding_sources`, `_gemini_search_call`.
+
+### Fixed
+
+- **`search_stream` double intent-refine bug**: `intent_refiner.refine_intent(query)`
+  was called twice (lines 984 and 988 in old `core.py`) — once before the
+  provider-RAG branch and once inside it. The second call discarded the first
+  `optimized_query`. Fixed: single call, result reused throughout the method.
+
+### Tests
+
+- 443 tests pass, 13 skipped — no regression from refactor.
+- `test_flamehaven_remote_client_flow` patch target updated: also patches
+  `flamehaven_filesearch._search_cloud._google_genai_types` after types moved
+  from `core.py` to `_search_cloud.py`.
+
+---
+
 ## [1.5.3] - 2026-04-19
 
 ### Added
 
@@ -7,7 +7,7 @@
 ### Self-hosted RAG search engine. Production-ready in 3 minutes.
 
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
-[![Version](https://img.shields.io/badge/version-1.5.3-blue.svg)](CHANGELOG.md)
+[![Version](https://img.shields.io/badge/version-1.6.1-blue.svg)](CHANGELOG.md)
 [![Python](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/)
 [![Docker](https://img.shields.io/badge/docker-ready-brightgreen.svg)](https://hub.docker.com/r/flamehaven/filesearch)
 
@@ -17,13 +17,13 @@
 
 ---
 
-## 🎯 Why FLAMEHAVEN?
+## 🎯 Why FLAMEHAVEN FileSearch?
 
-Stop sending your sensitive documents to third-party services. Get enterprise-grade semantic search running locally in minutes, not days.
+Stop sending your sensitive documents to third-party services. FLAMEHAVEN FileSearch is a production-grade RAG search engine — BM25+hybrid retrieval, 34 file formats, multi-LLM (Gemini, OpenAI, Claude, Ollama) — running self-hosted in minutes, not days.
 
 ```bash
 # One command. Three minutes. Done.
-docker run -d -p 8000:8000 -e GEMINI_API_KEY="your_key" flamehaven-filesearch:1.5.2
+docker run -d -p 8000:8000 -e GEMINI_API_KEY="your_key" flamehaven-filesearch:1.6.1
 ```
 
 <table>
@@ -57,9 +57,9 @@ Open source & MIT licensed</p>
 
 | Capability | Detail |
 |---|---|
-| **Search Modes** | Keyword, semantic, and hybrid with automatic typo correction |
+| **Search Modes** | Keyword, semantic, and hybrid (BM25+RRF) with automatic typo correction |
 | **34 File Formats** | PDF, DOCX/DOC, XLSX, PPTX, RTF, HTML, CSV, LaTeX, WebVTT, images + plain text — see [Document Parsing](docs/wiki/Document_Parsing.md) |
-| **RAG Pipeline** | Structure-aware chunking, sliding-window context enrichment, mtime parse cache |
+| **RAG Pipeline** | Structure-aware chunking, KnowledgeAtom 2-level indexing, sliding-window context enrichment, mtime parse cache |
 | **Ultra-Fast Vectors** | DSP v2.0 generates embeddings in <1ms — no ML frameworks required |
 | **Source Attribution** | Every answer links back to the originating document and chunk |
 | **Framework SDKs** | LangChain, LlamaIndex, Haystack, CrewAI adapters out of the box |
@@ -83,7 +83,7 @@ docker run -d \
   -e GEMINI_API_KEY="your_gemini_api_key" \
   -e FLAMEHAVEN_ADMIN_KEY="secure_admin_password" \
   -v $(pwd)/data:/app/data \
-  flamehaven-filesearch:1.5.2
+  flamehaven-filesearch:1.6.1
 ```
 
 ✅ Server running at `http://localhost:8000`
@@ -167,7 +167,7 @@ pip install flamehaven-filesearch[all]
 # Build from source
 git clone https://github.com/flamehaven01/Flamehaven-Filesearch.git
 cd Flamehaven-Filesearch
-docker build -t flamehaven-filesearch:1.5.2 .
+docker build -t flamehaven-filesearch:1.6.1 .
 ```
 
 ### Framework Integrations
@@ -259,7 +259,7 @@ security:
 </tr>
 <tr>
 <td>Test Suite</td>
-<td><code>443 tests</code></td>
+<td><code>476 tests</code></td>
 <td>All passing (pytest)</td>
 </tr>
 <tr>
@@ -299,8 +299,9 @@ flowchart TD
     subgraph Engine["Engine Layer"]
         FP["FileParser\n+ BackendRegistry\n(34 formats)"]
         Cache["ParseCache\n(mtime-based)"]
-        Chunker["TextChunker\n+ ContextExtractor"]
+        Chunker["TextChunker\n+ KnowledgeAtom\n(chunk atoms)"]
         DSP["DSP v2.0\nEmbedding Generator\n(&lt;1ms, zero-ML)"]
+        BM25["BM25 + RRF\nHybrid Search\n(v1.6.0)"]
         Scorer["SemanticScorer\n+ TypoCorrector"]
     end
 
@@ -383,7 +384,14 @@ Full roadmap: [ROADMAP.md](ROADMAP.md)
 - [x] Backend Plugin Architecture — `AbstractFormatBackend` + `BackendRegistry` (v1.5.2)
 - [x] Parse cache — mtime-based, `extract_text(use_cache=True)` (v1.5.2)
 - [x] ContextExtractor — sliding-window RAG chunk enrichment (v1.5.2)
-- [x] 443 tests; AI-Slop-Detector critical deficits: 0 (v1.5.2)
+- [x] Multi-provider LLM support — OpenAI, Claude, Ollama, Gemini (v1.5.3)
+
+### v1.6.0 (Completed)
+- [x] BM25 + RRF hybrid search — Korean+English tokenizer, lazy per-store index
+- [x] KnowledgeAtom 2-level indexing — chunk atoms with fragment URIs
+- [x] Stable URI scheme — `local://<store>/<quote(abs_path)>`, collision-free
+- [x] core.py mixin segmentation — 1258 → 221 lines, 3 focused modules
+- [x] Fix: `search_stream` double intent-refine bug
 
 ### v2.0.0 (Q3 2026)
 - [ ] Multi-language support (15+ languages) — multilingual stopwords + jieba
@@ -465,9 +473,10 @@ Use the links below to jump to the most relevant guide.
 | Topic | Description |
 |-------|-------------|
 | [Document Parsing](docs/wiki/Document_Parsing.md) | Supported formats, internal parsers, RAG chunking |
+| [Hybrid Search](docs/wiki/Hybrid_Search.md) | BM25+RRF, KnowledgeAtom indexing, stable URI scheme (v1.6.0) |
 | [Framework Integrations](docs/wiki/Framework_Integrations.md) | LangChain, LlamaIndex, Haystack, CrewAI adapters |
 | [API Reference](docs/wiki/API_Reference.md) | REST endpoints, payloads, rate limits |
-| [Architecture](docs/wiki/Architecture.md) | How all layers fit together (v1.5.2) |
+| [Architecture](docs/wiki/Architecture.md) | How all layers fit together (v1.6.0) |
 | [Configuration Reference](docs/wiki/Configuration.md) | Full list of environment variables and config fields |
 | [Production Deployment](docs/wiki/Production_Deployment.md) | Docker, systemd, reverse proxy, scaling tips |
 | [Troubleshooting](docs/wiki/Troubleshooting.md) | Step-by-step debugging playbook |
@@ -536,6 +545,6 @@ Built with amazing open source tools:
 
 Built with 🔥 by the Flamehaven Core Team
 
-*Last updated: April 19, 2026 • Version 1.5.3*
+*Last updated: April 19, 2026 • Version 1.6.1*
 
 </div>
@@ -4,6 +4,18 @@ This roadmap reflects the current constraints and priorities for Flamehaven
 FileSearch. Weekly usage budget is ~2%, so immediate focus is cost and quota
 pressure reduction before expanding surface area.
 
+## v1.6.0 (Released: 2026-04-19)
+
+**Focus:** Native RAG architecture — BM25+RRF hybrid search, chunk-level indexing.
+
+- [x] BM25 engine — Korean+English tokenizer, k1=1.5, b=0.75, lazy per-store rebuild.
+- [x] RRF fusion (k=60) — merges BM25 and ChronosGrid semantic lists by URI.
+- [x] KnowledgeAtom 2-level indexing — chunk atoms with `#cNNNN` fragment URIs.
+- [x] Stable URI scheme — `local://<store>/<quote(abs_path)>`, collision-free.
+- [x] core.py mixin segmentation — 1258 → 221 lines; 3 focused mixin modules.
+- [x] Fix: `search_stream` double intent-refine bug.
+- [x] 443 tests pass, 13 skipped; AI-Slop-Detector: CLEAN.
+
 ## Next Steps (Now)
 
 - [ ] Cache + cost improvements (cache hit tracking by search mode/backend,
 
@@ -1,7 +1,7 @@
 # Architecture Overview
 
 Flamehaven FileSearch balances simplicity with production-grade safeguards. This
-document describes the moving parts as of **v1.5.2**, featuring:
+document describes the moving parts as of **v1.6.0**, featuring:
 - **Gravitas DSP Engine** (v1.3.1+)
 - **Multimodal Search** (v1.4.0+)
 - **pgvector with HNSW** (v1.4.0+)
@@ -10,6 +10,8 @@ document describes the moving parts as of **v1.5.2**, featuring:
 - **Universal Document Parser, Internal Chunker, Framework Integrations** (v1.5.0)
 - **Dead code removal, critical complexity fixes, 360-test suite** (v1.5.1)
 - **Parse Cache, ContextExtractor, Backend Plugin Architecture** (v1.5.2)
+- **Multi-provider LLM support** (v1.5.3)
+- **BM25+RRF Hybrid Search, KnowledgeAtom, Mixin Architecture** (v1.6.0)
 
 ---
 
@@ -42,13 +44,73 @@ Request → │ FastAPI Router│ ─────> │ Middleware  │ ──┐
 
 ---
 
-## 2. Core Search Engine (v1.4.1)
+## 2. Core Architecture (v1.6.0 — Mixin Pattern)
 
-`FlamehavenFileSearch` (in `core.py`) now supports three primary search modes:
+`FlamehavenFileSearch` is now a thin orchestrator composed of three focused mixins:
 
-- **Keyword Mode** – Traditional exact match indexing.
-- **Semantic Mode (OMEGA)** – Powered by the **Gravitas DSP Engine**. Uses Deterministic Semantic Projection (v2.0) to map text into a 384-dimensional space without heavy ML dependencies.
-- **Hybrid Mode** – Combines both keyword and semantic scores for maximum precision.
+```
+core.py (221 lines)
+  FlamehavenFileSearch(IngestMixin, LocalSearchMixin, CloudSearchMixin)
+    __init__ / create_store / list_stores / delete_store / get_metrics
+
+_ingest.py (228 lines) — IngestMixin
+  upload_file / upload_files / _local_upload / _generate_file_vector
+
+_search_local.py (273 lines) — LocalSearchMixin
+  _local_search / _run_hybrid_rerank / _rebuild_bm25
+  _get_doc_by_uri / _build_snippet / _build_rag_prompt / _provider_search
+
+_search_cloud.py (265 lines) — CloudSearchMixin
+  search / search_stream / search_multimodal
+  + shared helpers: _resolve_search_params / _ensure_store /
+    _query_vector_backend / _driftlock_validate /
+    _extract_grounding_sources / _gemini_search_call
+```
+
+`FlamehavenFileSearch` supports three primary search modes:
+
+- **Keyword Mode** – BM25-scored exact match indexing across all stored content.
+- **Semantic Mode (OMEGA)** – Powered by the **Gravitas DSP Engine**. Uses
+  Deterministic Semantic Projection (v2.0) to map text into a 384-dimensional
+  space without heavy ML dependencies.
+- **Hybrid Mode** – BM25 + ChronosGrid semantic merged via Reciprocal Rank
+  Fusion (RRF, k=60). See [Section 2a](#2a-bm25--rrf-hybrid-search) below.
+
+### 2a. BM25 + RRF Hybrid Search
+
+**Implementation:** `engine/hybrid_search.py`
+
+```
+BM25 (k1=1.5, b=0.75)
+  tokenizer: re.findall(r"[a-z0-9\uac00-\ud7a3]+", text.lower())
+  Korean Hangul syllable range: \uac00-\ud7a3
+  Lazy index per store — rebuilt only after uploads (_bm25_dirty flag)
+  Corpus: main docs + chunk atoms (full 2-level coverage)
+
+RRF(d) = sum(1 / (k + rank_i))   k=60, rank from each result list
+
+Fusion inputs:
+  List A: ChronosGrid semantic results  (similarity-ranked)
+  List B: BM25 scored results           (BM25 score-ranked)
+  ID key: stable URI string (collision-free across lists)
+
+Output: top-k docs resolved via _get_doc_by_uri()
+```
+
+### 2b. KnowledgeAtom 2-Level Indexing
+
+**Implementation:** `engine/knowledge_atom.py`
+
+Level 1 — File doc: `local://<store>/<quote(abs_path)>`
+Level 2 — Chunk atoms: `local://<store>/<quote(abs_path)>#c0001`
+
+`chunk_and_inject()`:
+- Splits content into 800-char windows with 120-char overlap
+- Skips chunks shorter than 80 chars (noise filter)
+- Embeds each chunk via `embedding_generator.generate()`
+- Injects into ChronosGrid for semantic retrieval
+- Registers in `_atom_store_docs[store_name][atom_uri]` for URI lookup
+- Both levels participate in BM25 corpus via `_rebuild_bm25()`
 
 ### Gravitas DSP Engine (v2.0)
 - **Zero-Dependency Vectorizer**: Replaced `sentence-transformers` with a lightweight, signed feature hashing algorithm.
@@ -84,9 +146,9 @@ The new **Chronos-Grid** integration handles high-speed vector storage and simil
 
 ---
 
-## 6. Testing & Quality (v1.4.2)
+## 6. Testing & Quality (v1.6.0)
 
-- **Test Framework**: `pytest` — 443 tests collected, all passing (360 + 83 new).
+- **Test Framework**: `pytest` — 443 tests pass, 13 skipped.
 - **Lint**: `black` (format) + `ruff` (lint/unused imports) — both enforced in CI.
 - **Validation**: `validators.py` enforces security policies (Filename 200-char max, FileSize, SearchQuery XSS/SQLi checks).
 - **SIDRCE Certification**: Omega 0.9894 (S++) — AI-Slop-Detector P0-P5 clean.
@@ -125,6 +187,8 @@ engine/
   parse_cache.py       — mtime-based parse result cache (v1.5.2)
   context_extractor.py — RAG chunk context window extractor (v1.5.2)
   text_chunker.py      — Structure-aware + token-aware RAG chunker (stdlib only)
+  hybrid_search.py     — BM25 + RRF fusion engine (v1.6.0)
+  knowledge_atom.py    — Chunk-level atom indexing with fragment URIs (v1.6.0)
   embedding_generator.py   — DSP v2.0 vectorizer
   chronos_grid.py      — Vector index + metadata store
   gravitas_pack.py     — Metadata compression