feat: add LlamaIndex integration by DK09876 · Pull Request #672 · vectorize-io/hindsight

DK09876 · 2026-03-24T13:57:15Z

Summary

Add hindsight-llamaindex package: BaseToolSpec subclass + create_hindsight_tools() factory giving LlamaIndex agents persistent memory via retain/recall/reflect
Full test suite (34 unit tests passing)
Docs page at hindsight-docs/docs/sdks/integrations/llamaindex.md
Blog post at hindsight-docs/blog/2026-03-23-llamaindex-memory.md
Entry added to integrations.json

Test plan

uv run pytest tests/ -v — 34 passed, 3 skipped (manual)
uv run ruff check . && uv run ruff format --check . — clean
E2E test with real Hindsight instance via test_manual.py
Verify docs page renders correctly
Verify blog post renders correctly

🤖 Generated with Claude Code

Add hindsight-llamaindex package providing persistent memory tools for LlamaIndex agents via the native BaseToolSpec pattern. Includes retain, recall, and reflect tools, a convenience factory, global config, full test suite, docs page, blog post, and integrations.json entry. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Fix ReActAgent API: from_tools() → constructor, chat() → await run() - Add create_bank step to all quickstart examples - Add production patterns section to docs (tags, error handling, bank lifecycle) - Add memory scoping recommendation to README - Add when-not-to-use section to blog post - Add LlamaIndex compatibility tests (agent acceptance, FunctionTool.call) - Fix self-hosted auth wording in cookbook notebook Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Use await client.acreate_bank() instead of sync create_bank() to avoid "event loop already running" errors in notebooks and async contexts - Wrap plain Python examples in async def main() + asyncio.run(main()) so they are copy-paste runnable as scripts - Add Jupyter notebook tip to docs showing top-level await pattern - Bank lifecycle example in docs now uses async acreate_bank Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

HindsightToolSpec now provides both sync and async tool implementations using LlamaIndex's (sync_fn, async_fn) tuple pattern in spec_functions. Async agents (ReActAgent, etc.) use aretain/arecall/areflect natively, avoiding the "Timeout context manager should be used inside a task" error that occurred when sync _run_async() was called from within an active event loop. - Add aretain_memory, arecall_memory, areflect_on_memory async methods - Extract shared kwargs builders (_retain_kwargs, _recall_kwargs, etc.) - spec_functions now uses tuples: [("retain_memory", "aretain_memory"), ...] - Tests verify tools have both sync fn and async fn set - Notebook verified end-to-end with nbclient against local Hindsight Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The blog post will be pulled in separately from its own PR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

nicoloboschi

Review

Overall the PR is well-structured — clean separation of concerns (tools.py, config.py, _client.py, errors.py), comprehensive tests (34 unit tests), and good docs. A few things worth discussing:

Package structure doesn't follow LlamaIndex conventions

The PR uses hindsight_llamaindex/ as a standalone package. The standard LlamaIndex community integration uses namespace packages:

# PR has:
hindsight-integrations/llamaindex/hindsight_llamaindex/

# LlamaIndex convention:
llama-index-tools-hindsight/llama_index/tools/hindsight/base.py

With LlamaIndex, llama_index/ and llama_index/tools/ must NOT have __init__.py (PEP 420 implicit namespace packages). Main class goes in base.py, not tools.py.

If the package is ever submitted to LlamaHub or users expect from llama_index.tools.hindsight import HindsightToolSpec, it won't work. Fine as a standalone package but breaks the LlamaIndex ecosystem path.

Suggestion: Either restructure now, or document explicitly that this is standalone (not LlamaHub) and plan migration later.

Missing `BaseMemory` implementation

The PR only implements BaseToolSpec (agent-driven retain/recall/reflect). LlamaIndex also has BaseMemory which provides automatic memory — get() enriches prompts with recalled memories transparently, put() auto-retains messages.

The Mem0 integration (llama-index-memory-mem0) does both: BaseMemory for automatic recall/retain + tools for explicit agent control. This would align better with how Claude Code 0.3.x works — recall injected automatically into UserPromptSubmit hooks, retain on Stop events.

Consistency gaps with other Hindsight integrations

Feature	Claude Code 0.3.x	CrewAI	This PR
Auto-recall (inject into prompt)	✅ (hook)	✅ (Storage.search)	❌ (tool only)
Auto-retain (on conversation end)	✅ (hook)	✅ (Storage.save)	❌ (tool only)
Bank mission setup	✅	✅	❌
`document_id` generation	✅ (session+timestamp)	N/A	Static only
`async: true` retain	✅	✅	❌
`context` source label	✅ (`"claude-code"`)	✅	❌
Error handling	Graceful (logs, continues)	Graceful	Raises HindsightError

Specific items:

No document_id auto-generation — Claude Code generates {session_id}-{timestamp} for upsert/grouping. This only supports a static retain_document_id.
No bank mission management — other integrations call set_bank_mission() on first use so the memory engine has context for fact extraction.
No context param on retain — Claude Code passes "claude-code" as source label.
Retain doesn't use async: true — other integrations use async retain for non-blocking storage. Valid for tool-based usage (agent expects confirmation) but worth noting.

Minor items

config.py — default URL is production (https://api.hindsight.vectorize.io) while CrewAI defaults to http://localhost:8888. Should be consistent across integrations.
_client.py — hardcoded timeout: 30.0. Claude Code uses different timeouts per operation (retain: 15s, recall: 10s, health: 5s).
Blog post referenced in PR description but not in the diff.

DK09876 and others added 4 commits March 24, 2026 06:56

DK09876 marked this pull request as ready for review March 25, 2026 00:23

chore: remove blog post from integration PR

b1fde46

The blog post will be pulled in separately from its own PR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

nicoloboschi reviewed Mar 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LlamaIndex integration#672

feat: add LlamaIndex integration#672
DK09876 wants to merge 5 commits intomainfrom
feat/llamaindex-integration

DK09876 commented Mar 24, 2026

Uh oh!

nicoloboschi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DK09876 commented Mar 24, 2026

Summary

Test plan

Uh oh!

nicoloboschi left a comment

Choose a reason for hiding this comment

Review

Package structure doesn't follow LlamaIndex conventions

Missing BaseMemory implementation

Consistency gaps with other Hindsight integrations

Minor items

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Missing `BaseMemory` implementation