Skip to content

Conversation

@dudilester
Copy link
Contributor

No description provided.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR introduces dynamic scaling for the V-Cache (value cache) on the hidden dimension, extending the existing dynamic scaling support beyond just the sequence length (T) dimension. The change modifies the value cache scaling mechanism to use a tuple of two scale tensors instead of a single tensor.

Key Changes:

  • Extended value cache scaling to support two dimensions: sequence length and hidden dimension
  • Modified value_scales from a single tensor to a tuple of two tensors (value_scales_on_T, value_scales_on_hidden)
  • Updated all related cache operations and type signatures to handle the new tuple structure

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
vllm_gaudi/v1/worker/hpu_model_runner.py Initializes two separate scale tensors for V-Cache and combines them into a tuple
vllm_gaudi/extension/ops.py Updates unflatten operation to handle tuple structure for v_scales
vllm_gaudi/extension/cache_ops.py Modifies copy_blocks to access first element of v_scales tuple
vllm_gaudi/attention/ops/hpu_paged_attn.py Updates type hints to reflect tuple structure for value scales
vllm_gaudi/attention/backends/hpu_attn.py Updates kv_cache type signature to reflect new tuple structure

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch 4 times, most recently from a38f37c to 348722d Compare December 30, 2025 06:39
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 316fef7 to 500c8ba Compare December 30, 2025 06:55
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

Signed-off-by: Dudi Lester <[email protected]>
@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from b695d45 to 86ee21a Compare January 1, 2026 08:54
@github-actions
Copy link

github-actions bot commented Jan 1, 2026

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

github-actions bot commented Jan 1, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

1 similar comment
@github-actions
Copy link

github-actions bot commented Jan 5, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch 2 times, most recently from e16b4c6 to 4d88cd6 Compare January 6, 2026 08:30
@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from d765839 to b3aaad1 Compare January 6, 2026 09:02
@github-actions
Copy link

github-actions bot commented Jan 6, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

1 similar comment
@github-actions
Copy link

github-actions bot commented Jan 7, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 2f61c6e to 686dba2 Compare January 12, 2026 15:41
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
aa125ecf0edb9cd67656553d11d643aeb444ff9e

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 88e44f5 to b02f2cc Compare January 15, 2026 15:08
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
4c1c501a7ee1d5efbad945ea62a702ce5cefb799

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants