Skip to content

Conversation

@kamil-kaczor
Copy link
Collaborator

No description provided.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR optimizes defragmentation operations by conditionally performing cache swaps only when prefix caching (APC) is enabled. When APC is disabled, the code skips unnecessary bidirectional swap operations that were previously always executed.

Key changes:

  • Added APC configuration detection to determine if prefix caching is enabled
  • Made the swap of free blocks back to used blocks conditional on APC being enabled
  • Added clarifying comments to distinguish between used and free block indices

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@github-actions
Copy link

github-actions bot commented Dec 8, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
da7bc54ea8f44a2dcacc4a9869721bd105006e10

@github-actions
Copy link

github-actions bot commented Dec 8, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
da7bc54ea8f44a2dcacc4a9869721bd105006e10

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant