Add dual context percentage fields to working memory endpoints #38

abrookins · 2025-07-22T21:09:51Z

Summary

Add dual context percentage fields to working memory endpoints to provide comprehensive visibility into context window usage and auto-summarization triggers.

Changes

New Fields Added

context_percentage_total_used: Shows actual percentage of total context window currently used (0-100%)
context_percentage_until_summarization: Shows percentage until auto-summarization triggers (0-100%, reaches 100% at summarization threshold)

Implementation Details

Updated API calculation function _calculate_context_usage_percentages() to return both values as a tuple
Modified both GET /v1/working-memory/{session_id} and PUT /v1/working-memory/{session_id} endpoints
Updated server models (WorkingMemoryResponse) with new fields
Updated SDK client models to match server changes
Added comprehensive test coverage for both fields
Maintains configurable summarization threshold (default 70% via SUMMARIZATION_THRESHOLD)

Backward Compatibility

Replaces the previous single context_usage_percentage field
All existing functionality preserved with enhanced context visibility

Benefits

Users now receive complete context information:

Total Usage: See exactly how much of the context window is currently used
Summarization Proximity: Know how close they are to automatic summarization trigger
Better Planning: Make informed decisions about message length and conversation flow

Testing

✅ All unit tests passing (Python 3.10, 3.11, 3.12)
✅ All integration tests passing (Redis 8.0.3, latest, redis-stack)
✅ Comprehensive SDK client test coverage (35/35 tests)
✅ Linting and formatting checks passed

Example Response

{
  "session_id": "example-session",
  "messages": [...],
  "context_percentage_total_used": 45.5,
  "context_percentage_until_summarization": 65.0,
  ...
}

Resolves #37

🤖 Generated with Claude Code

- Add context_usage_percentage field to WorkingMemoryResponse model - Add _calculate_context_usage_percentage() helper function - Update GET /v1/working-memory/{session_id} to return percentage - Update PUT /v1/working-memory/{session_id} to return percentage based on final state (after potential summarization) - Percentage calculated as (current_tokens / token_threshold) * 100 where token_threshold = context_window * 0.7 - Returns None when no model info provided, otherwise 0-100% value Resolves #37 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Andrew Brookins <[email protected]>

Copilot

Pull Request Overview

This PR adds context usage percentage tracking to working memory endpoints to help monitor how much of the context window is being utilized before auto-summarization is triggered. The change provides visibility into memory usage patterns and helps understand when summarization occurs.

Key Changes:

Added context_usage_percentage field to WorkingMemoryResponse model
Implemented calculation logic to determine percentage of context window used
Updated GET and PUT working memory endpoints to include the percentage in responses

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
agent_memory_server/models.py	Added `context_usage_percentage` field to `WorkingMemoryResponse`
agent_memory_server/api.py	Implemented context usage calculation and updated endpoints to return percentage
tests/test_full_integration.py	Code formatting improvements for assert statements

agent_memory_server/api.py

🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Resolve TypeError by properly handling the context_usage_percentage field in WorkingMemoryResponse creation to avoid duplicate keyword arguments. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add context_usage_percentage field to WorkingMemoryResponse model - Add comprehensive test suite for the new field covering: - Field creation and default values - Serialization behavior - Validation of different percentage values - Dictionary-to-model conversion 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Address review comments by making the 0.7 threshold configurable instead of hardcoded. Added summarization_threshold setting that can be configured via environment variable or config file. - Added summarization_threshold to Settings (default: 0.7) - Updated both _calculate_context_usage_percentage and _summarize_working_memory to use settings.summarization_threshold - Improved maintainability and consistency between functions - Allows users to customize when summarization is triggered 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add context_percentage_total_used field showing actual context window usage (0-100%) - Add context_percentage_until_summarization field showing percentage until auto-summarization triggers (0-100%) - Update API calculation function to return both values as tuple - Update server and SDK models with new fields - Update comprehensive test coverage for both fields - Remove old single context_usage_percentage field - Maintain configurable summarization threshold (default 70%) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Copilot AI review requested due to automatic review settings July 22, 2025 21:09

Copilot AI reviewed Jul 22, 2025

View reviewed changes

agent_memory_server/api.py Outdated Show resolved Hide resolved

agent_memory_server/api.py Outdated Show resolved Hide resolved

abrookins and others added 5 commits July 24, 2025 16:33

Fix code formatting in test_full_integration.py

a1a778a

🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

abrookins changed the title ~~Add context usage percentage to working memory endpoints~~ Add dual context percentage fields to working memory endpoints Jul 25, 2025

abrookins merged commit abb0fff into main Jul 26, 2025
10 checks passed

abrookins deleted the claude/issue-37-20250722-2011 branch July 26, 2025 00:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add dual context percentage fields to working memory endpoints #38

Add dual context percentage fields to working memory endpoints #38

abrookins commented Jul 22, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add dual context percentage fields to working memory endpoints #38

Add dual context percentage fields to working memory endpoints #38

Conversation

abrookins commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

New Fields Added

Implementation Details

Backward Compatibility

Benefits

Testing

Example Response

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes:

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abrookins commented Jul 22, 2025 •

edited

Loading