multimodal model embedding fixes #759

libinta · 2025-12-23T23:08:14Z

remove scatter_mm_placeholders and gather_mm_placeholders from hpu_model_runner as upstream PR [Core][MM] Optimize encoder cache manager by operating with embeddings only vllm#30475 optimized it
add HpuQwen3_VLForConditionalGeneration to use index_copy for _merge_multimodal_embeddings

Also overwrite qwen3_vl function to use _merge_multimodal_embeddings with index copy.

github-actions · 2025-12-23T23:08:24Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Format update

format

update for pre-commit error: vllm_gaudi/models/qwen3_vl.py:62:25: F821 Undefined name `_require_is_multimodal`

iboiko-habana · 2025-12-30T13:51:58Z

Please fix basic test
ERROR 12-30 13:48:15 [registry.py:751] File "/usr/local/lib/python3.12/dist-packages/vllm/model_executor/models/qwen3_vl.py", line 41, in <module> ERROR 12-30 13:48:15 [registry.py:751] from transformers.models.qwen3_vl import Qwen3VLProcessor, Qwen3VLVideoProcessor ERROR 12-30 13:48:15 [registry.py:751] File "/usr/local/lib/python3.12/dist-packages/habana_frameworks/torch/core/__init__.py", line 106, in wrapper ERROR 12-30 13:48:15 [registry.py:751] ret = original_fn(*args, **kwargs) ERROR 12-30 13:48:15 [registry.py:751] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ERROR 12-30 13:48:15 [registry.py:751] ModuleNotFoundError: No module named 'transformers.models.qwen3_vl'

skaulintel · 2025-12-30T20:46:39Z

Please fix basic test ERROR 12-30 13:48:15 [registry.py:751] File "/usr/local/lib/python3.12/dist-packages/vllm/model_executor/models/qwen3_vl.py", line 41, in <module> ERROR 12-30 13:48:15 [registry.py:751] from transformers.models.qwen3_vl import Qwen3VLProcessor, Qwen3VLVideoProcessor ERROR 12-30 13:48:15 [registry.py:751] File "/usr/local/lib/python3.12/dist-packages/habana_frameworks/torch/core/__init__.py", line 106, in wrapper ERROR 12-30 13:48:15 [registry.py:751] ret = original_fn(*args, **kwargs) ERROR 12-30 13:48:15 [registry.py:751] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ERROR 12-30 13:48:15 [registry.py:751] ModuleNotFoundError: No module named 'transformers.models.qwen3_vl'

I believe qwen3_vl only shows up from transformers 4.57.0 onwards. the CI tests seem to be using
transformers 4.55.4

github-actions · 2026-01-15T19:27:59Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

vllm_gaudi/models/utils.py

Signed-off-by: Libin Tang <litang@habana.ai>

github-actions · 2026-01-22T07:40:10Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

libinta added 2 commits December 23, 2025 14:11

Pick model runner change related to PR30475.

9d8e272

Also overwrite qwen3_vl function to use _merge_multimodal_embeddings with index copy.

add qwen3_vl.py functions

49d7633

libinta requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners December 23, 2025 23:08

Merge branch 'main' into libinta/remove_gather_scatter

bdff63f

github-actions bot mentioned this pull request Dec 24, 2025

🚦 Team Review Dashboard #701

Open

libinta and others added 8 commits December 23, 2025 22:30

precomit fix

c6526de

precommit fix and fix use_window_sdpa

7c6329e

Update qwen3_vl.py

bff3cf5

Format update

Update qwen3_vl.py

625d9c2

format

Merge branch 'main' into libinta/remove_gather_scatter

568b4eb

Merge branch 'main' into libinta/remove_gather_scatter

495643a

Merge branch 'main' into libinta/remove_gather_scatter

327a9cc

Update qwen3_vl.py

bb3ac24

update for pre-commit error: vllm_gaudi/models/qwen3_vl.py:62:25: F821 Undefined name `_require_is_multimodal`

libinta added 2 commits January 4, 2026 20:04

Merge branch 'main' into libinta/remove_gather_scatter

fe67f98

Merge branch 'main' into libinta/remove_gather_scatter

8a9efd1

iboiko-habana approved these changes Jan 5, 2026

View reviewed changes

libinta and others added 3 commits January 8, 2026 15:14

Merge branch 'main' into libinta/remove_gather_scatter

a394b9a

Merge branch 'main' into libinta/remove_gather_scatter

6502061

fix test failure

48a96db

iboiko-habana and others added 3 commits January 15, 2026 20:29

Merge branch 'main' into libinta/remove_gather_scatter

0171641

fix precommit issue

db10548

Update interfaces.py for precommit fix

40d7635

skaulintel reviewed Jan 16, 2026

View reviewed changes

vllm_gaudi/models/utils.py Show resolved Hide resolved

Update hpu_model_runner.py to match with upstream for MultiModalBudget

e23e6d2

slokesha mentioned this pull request Jan 16, 2026

Slokesha/enable qwen3 #828

Draft

iboiko-habana approved these changes Jan 16, 2026

View reviewed changes

libinta and others added 4 commits January 19, 2026 14:00

Merge branch 'main' into libinta/remove_gather_scatter

46facad

Signed-off-by: Libin Tang <litang@habana.ai>

Update qwen3_vl.py for precommit fix

4089adf

Update qwen3_vl.py for precommit fix

79d90a4

Merge branch 'main' into libinta/remove_gather_scatter

e370a49

iboiko-habana approved these changes Jan 21, 2026

View reviewed changes

iboiko-habana and others added 3 commits January 21, 2026 10:33

Merge branch 'main' into libinta/remove_gather_scatter

0df1f20

Merge branch 'main' into libinta/remove_gather_scatter

5fdf237

add back warmup with ratio and video warmup

07f40c9

skaulintel mentioned this pull request Jan 21, 2026

Remove gather scatter v0.14.0 #857

Closed

Update ops.py with removing uncessary change

9db6b78

skaulintel mentioned this pull request Jan 22, 2026

port remove gather and scatter to v0.14.0 release #858

Open

Update hpu_model_runner.py for precommit fix

9be0056

libinta added 5 commits January 21, 2026 23:40

Merge branch 'main' into libinta/remove_gather_scatter

3ff7e80

Update hpu_model_runner.py for precommit fix

b4f2e6c

Update hpu_model_runner.py for precommit fix

02c239b

Update hpu_model_runner.py for precommit fix

3dd1f5c

Update hpu_model_runner.py for precommit fix

7757e80

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

multimodal model embedding fixes #759

multimodal model embedding fixes #759

libinta commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

iboiko-habana commented Dec 30, 2025

Uh oh!

skaulintel commented Dec 30, 2025

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

multimodal model embedding fixes #759

Are you sure you want to change the base?

multimodal model embedding fixes #759

Conversation

libinta commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

🚧 CI Blocked

Uh oh!

iboiko-habana commented Dec 30, 2025

Uh oh!

skaulintel commented Dec 30, 2025

Uh oh!

github-actions bot commented Jan 15, 2026

🚧 CI Blocked

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026

🚧 CI Blocked

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants