WA shared bias in UA #727

adobrzyn · 2025-12-16T16:27:14Z

No description provided.

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

Copilot

Pull request overview

This PR implements a workaround to address out-of-memory issues related to huge shared bias in the unified bucketing strategy by limiting the maximum shared context to a single block size instead of calculating it based on model length and available blocks.

Temporarily caps max_shared_ctx to block_size to prevent OOM errors
Comments out the original calculation logic for future restoration
Adds a TODO comment indicating this is a temporary workaround

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_gaudi/extension/bucketing/unified.py

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

github-actions · 2026-01-02T15:29:30Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

github-actions · 2026-01-02T16:56:30Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

Signed-off-by: Agata Dobrzyniewicz <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]> Signed-off-by: Jin, Youzhi <[email protected]>

WA ua

40e4445

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

Copilot AI review requested due to automatic review settings December 16, 2025 16:27

adobrzyn requested review from afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners December 16, 2025 16:27

Copilot AI reviewed Dec 16, 2025

View reviewed changes

vllm_gaudi/extension/bucketing/unified.py Outdated Show resolved Hide resolved

github-actions bot mentioned this pull request Dec 16, 2025

🚦 Team Review Dashboard #701

Open

Less shared

23fd99c

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

adobrzyn added 2 commits January 2, 2026 16:29

Merge branch 'main' into adobrzyn/ua_shared_wa

cf0104f

Update unified.py

161938d

Signed-off-by: Agata Dobrzyniewicz <[email protected]>

kzawora-intel approved these changes Jan 7, 2026

View reviewed changes

adobrzyn merged commit 19be0a2 into main Jan 8, 2026
50 checks passed

adobrzyn deleted the adobrzyn/ua_shared_wa branch January 9, 2026 10:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WA shared bias in UA #727

WA shared bias in UA #727

Uh oh!

adobrzyn commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

github-actions bot commented Jan 2, 2026

Uh oh!

github-actions bot commented Jan 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WA shared bias in UA #727

WA shared bias in UA #727

Uh oh!

Conversation

adobrzyn commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

github-actions bot commented Jan 2, 2026

🚧 CI Blocked

Uh oh!

github-actions bot commented Jan 2, 2026

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants