[MOE] Fix issues with batch, use postponed_const when fusing expert weights #32546

mmikolajcz · 2025-10-24T16:50:16Z

Details:

Use postponed constant instead make try fold to limit memory usage when folding large concatenation
Fix issue where inference would fail in batch sizes different than 1 due to incorrect reshape target

Tickets:

ticket-id

…into mateuszm/moe_fix_bs_postponed

mryzhov · 2025-10-28T15:47:47Z

.github/workflows/job_pytorch_models_tests.yml

+        if: ${{ inputs.model_scope == 'precommit' }}
+        run: |
+          export PYTHONPATH=${MODEL_HUB_TESTS_INSTALL_DIR}:$PYTHONPATH
+          python3 -m pip install optimum-intel==1.25.2 transformers==4.53.3 --upgrade


That is a bad practice to introduce test dependencies in the pipeline, why can't we use the common test requirements?

Upgrade to transformers and optimum-intel is required to use versions that support qwen3_moe models.
Ideally it would be best to upgrade tests/requirements_pytorch to align versions, however this might be bit risky without thorough testing since this requirements are used in large amount of model tests, including nightly large models.
I've initially tried running other precommit suites test_pa_transformation test_transformations and it seem they were successful with upgraded versions

Fix issue with batch_size, use postponed constant to fold

33a43ac

mmikolajcz added this to the 2025.4 milestone Oct 24, 2025

github-actions bot added category: transformations OpenVINO Runtime library - Transformations category: TF FE OpenVINO TensorFlow FrontEnd category: PyTorch FE OpenVINO PyTorch Frontend category: JAX FE OpenVINO JAX FrontEnd labels Oct 24, 2025

mryzhov approved these changes Oct 27, 2025

View reviewed changes

FIx tests not executed

83fcd12

github-actions bot added category: CI OpenVINO public CI github_actions Pull requests that update GitHub Actions code labels Oct 27, 2025

mmikolajcz added 6 commits October 27, 2025 12:11

Reset cache for MoE

24f9a6f

Fix and clean up Fuse MoE tests

f4900a4

Try to work-around CI requirements to enable e2e moe tests

67f9106

Merge branch 'master' of https://github.com/openvinotoolkit/openvino …

4f870ea

…into mateuszm/moe_fix_bs_postponed

Fix transformers/optimum version in MOE CI tests

8380e46

Merge branch 'master' into mateuszm/moe_fix_bs_postponed

74bc991

mmikolajcz marked this pull request as ready for review October 28, 2025 08:13

mmikolajcz requested review from a team as code owners October 28, 2025 08:13

mmikolajcz requested review from CuriousPanCake and removed request for a team October 28, 2025 08:13

moslex added the Code Freeze label Oct 28, 2025

Disable pass and e2e tests again until plugin support

d025714

mryzhov reviewed Oct 28, 2025

View reviewed changes

moslex added the priority: high High piority label Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MOE] Fix issues with batch, use postponed_const when fusing expert weights #32546

[MOE] Fix issues with batch, use postponed_const when fusing expert weights #32546

mmikolajcz commented Oct 24, 2025

Uh oh!

mryzhov Oct 28, 2025

Uh oh!

mmikolajcz Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[MOE] Fix issues with batch, use postponed_const when fusing expert weights #32546

Are you sure you want to change the base?

[MOE] Fix issues with batch, use postponed_const when fusing expert weights #32546

Conversation

mmikolajcz commented Oct 24, 2025

Details:

Tickets:

Uh oh!

mryzhov Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

mmikolajcz Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants