Skip to content

v1.3.1

Choose a tag to compare

@flamehaven01 flamehaven01 released this 16 Dec 12:57
· 66 commits to main since this release
abbf376

Added - Phase 2 & 3: Semantic Search + Gravitas-Pack Integration

  • Gravitas Vectorizer v2.0: Custom Deterministic Semantic Projection (DSP) algorithm
    • Zero ML dependencies (removed sentence-transformers 500MB+)
    • Instant initialization (<1ms vs 2min+ before)
    • Hybrid feature extraction: word tokens (2.0x weight) + char n-grams (3-5)
    • Signed feature hashing for collision mitigation
    • 384-dimensional unit-normalized vectors
    • LRU caching with 16.7%+ hit rate
  • GravitasPacker: Symbolic compression integrated into cache layer
    • 90%+ compression ratio on metadata
    • Deterministic lore scroll generation
    • Instant decompression (<1ms)
  • Vector Quantizer: int8 quantization for 75% memory reduction
    • Asymmetric quantization (per-vector min/max calibration)
    • 30%+ speedup on cosine similarity calculations
    • Backward compatible with float32 vectors
  • Search Modes: keyword, semantic, hybrid via search_mode parameter
  • API Schema Enhancement: Added refined_query, corrections, search_mode, search_intent, semantic_results to SearchResponse
  • Chronos-Grid Integration: Vector storage and similarity search
  • Intent-Refiner: Typo correction and query optimization
  • unittest Migration: Replaced pytest with Python stdlib unittest
    • 19/19 tests passing in 0.33s
    • Zero timeout issues
    • Master test suite: tests/run_all_tests.py
  • Performance Benchmarks: Dedicated benchmark suite for DSP v2.0
  • Documentation: README, CHANGELOG, TOMB files updated

Performance

  • Vector generation: <1ms per text
  • Memory: 1536 bytes → 384 bytes per vector (75% reduction)
  • Metadata compression: 90%+ ratio
  • Search speed: 30% faster on quantized vectors
  • Similar text similarity: 0.787 (78.7%)
  • Differentiation ratio: 2.36x
  • Cache efficiency: 16.7%+ hit rate
  • Search 1000 vectors: <100ms
  • Precision loss: <0.1% (negligible for file search)

Changed

  • EmbeddingGenerator completely rewritten with DSP algorithm
  • cache.py: Integrated GravitasPacker compression/decompression
  • chronos_grid.py: Optional quantization support
  • Test infrastructure migrated from pytest to unittest
  • Docker metadata updated to v1.3.1
  • README badges and features updated
  • Version bumped to 1.3.1

Fixed

  • Critical: pytest timeout issue resolved via unittest migration
  • Critical: sentence-transformers blocking eliminated
  • ASCII safety enforced across all output (no Unicode in cp949 environment)

Tests

  • python tests/run_all_tests.py (19/19 passed, 0.33s)
  • All semantic search features verified
  • Performance benchmarks included

Breaking Changes

  • None (fully backward compatible)
  • search_mode parameter is optional with default behavior preserved