[Core] Supports stage abstraction in the diffusion model #391

fake0fan · 2025-12-20T13:33:02Z

To align with our initial vision and for future overall optimization, we gradually began to provide Stage abstractions for Diffusion.

[Core] Add Stage Abstraction Support for Diffusion Models

Overview

This PR adds stage abstraction support for the diffusion model component of vLLM-Omni, achieving a consistent architectural design with LLM models. It also includes code refactoring to unify the sampling parameter interface, improving code maintainability and extensibility.

Major Changes

1. Code Refactoring

Refactored entry point code structure: Refactored and integrated LLM-related code from omni_llm.py into omni.py, unifying entry point management
Enhanced Stage abstraction: Extended omni_stage.py to support stage configuration and management for diffusion models

2. Diffusion Stage Abstraction Support

New unified sampling parameter class (omni_sampling_params.py):
- Created OmniSamplingParams class to uniformly manage sampling parameters for both LLM and diffusion models
- Supports LLM parameters (temperature, top_p, top_k, etc.) and diffusion parameters (num_inference_steps, guidance_scale, etc.)
- Provides conversion methods with vLLM SamplingParams
Extended Diffusion Engine:
- Updated diffusion_engine.py to support stage abstraction
- Enhanced stage support in gpu_worker.py
Updated configuration system:
- Added configuration file: QwenImagePipeline.yaml
- Updated stage configuration files for multiple models

3. Example Updates

Updated text_to_image.py example to demonstrate how to use the new stage abstraction interface

4. Outputs

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-12-20T13:38:51Z

vllm_omni/entrypoints/omni.py

        assert model != "", "Null model id detected, please specify a model id."
        model = omni_snapshot_download(model)
        if args:
            args[0] = model


Avoid assigning to immutable args tuple in Omni init

When Omni is instantiated with the model passed positionally (e.g. Omni("Qwen/Qwen-Image")), the constructor assigns to args[0], but args is a tuple, so the assignment raises TypeError: 'tuple' object does not support item assignment before any initialization occurs. This makes the new entrypoint unusable for positional calls that previously worked with OmniLLM; callers must now pass model as a keyword or hit a hard crash.

Useful? React with 👍 / 👎.

ZJY0516 · 2025-12-20T14:52:02Z

looking forward to it

hsliuustc0106 · 2025-12-20T15:29:03Z

let's get the initial version done before 1230 release

ZJY0516 · 2025-12-22T03:25:53Z

Does this pr support reuse vllm as text encoding stage for diffusion models?

fake0fan · 2025-12-22T03:31:12Z

Does this pr support reuse vllm as text encoding stage for diffusion models?

Not yet.

This PR only encapsulates the entire diffusion model into a single stage first.

ZJY0516 · 2025-12-22T03:33:56Z

vllm_omni/model_executor/stage_configs/QwenImagePipeline.yaml

@@ -0,0 +1,36 @@
+# stage config for running Qwen-Image with diffusion stage type.


I have concerns about adding this for diffusion models, as I believe it introduces a significant UX drawback.

princepride · 2025-12-22T05:57:36Z

Does this PR mean that all models under diffusion folder can be deployed using YAML?

fake0fan · 2025-12-23T02:03:39Z

Does this PR mean that all models under diffusion folder can be deployed using YAML?

Through some offline discussions, we decided that this version will not require providing a yaml file for the Diffusion model. Instead, the system will automatically generate a YAML file for the current Diffusion model.

princepride · 2025-12-23T02:10:07Z

In which group you are discussing? Can you add me?

erfgss · 2025-12-23T02:55:57Z

Fixes #340

Signed-off-by: Chenguang ZHENG <[email protected]>

examples/online_serving/text_to_image/gradio_demo.py

Signed-off-by: Chenguang ZHENG <[email protected]>

fake0fan requested a review from hsliuustc0106 as a code owner December 20, 2025 13:33

chatgpt-codex-connector bot reviewed Dec 20, 2025

View reviewed changes

ZJY0516 self-requested a review December 20, 2025 15:30

hsliuustc0106 mentioned this pull request Dec 21, 2025

[New Model]Bagel model(Diffusion Only) #319

Open

5 tasks

ZJY0516 reviewed Dec 22, 2025

View reviewed changes

Bounty-hunter mentioned this pull request Dec 23, 2025

[Bug]: /v1/model endpoint fails #414

Open

1 task

fake0fan force-pushed the test_stage branch from 8e44488 to 7f9aecc Compare December 23, 2025 15:25

fake0fan added 2 commits December 23, 2025 15:50

final version

4923fd0

Signed-off-by: Chenguang ZHENG <[email protected]>

support image/generation

d5be79d

Signed-off-by: Chenguang ZHENG <[email protected]>

fake0fan force-pushed the test_stage branch from 7f9aecc to d5be79d Compare December 23, 2025 16:07

fake0fan changed the title ~~[WIP][Core] Supports stage abstraction in the diffusion model~~ [Core] Supports stage abstraction in the diffusion model Dec 23, 2025

SamitHuang reviewed Dec 24, 2025

View reviewed changes

examples/online_serving/text_to_image/gradio_demo.py Outdated Show resolved Hide resolved

examples/online_serving/text_to_image/gradio_demo.py Outdated Show resolved Hide resolved

pass simple tests

94cd830

Signed-off-by: Chenguang ZHENG <[email protected]>

david6666666 added the ready label to trigger buildkite CI label Dec 24, 2025

hsliuustc0106 mentioned this pull request Dec 24, 2025

[Roadmap]: preparing for 1230 release #165

Open

59 tasks

		@@ -0,0 +1,36 @@
		# stage config for running Qwen-Image with diffusion stage type.

[Core] Supports stage abstraction in the diffusion model #391

Are you sure you want to change the base?

[Core] Supports stage abstraction in the diffusion model #391

Uh oh!

Conversation

fake0fan commented Dec 20, 2025

[Core] Add Stage Abstraction Support for Diffusion Models

Overview

Major Changes

1. Code Refactoring

2. Diffusion Stage Abstraction Support

3. Example Updates

4. Outputs

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

ZJY0516 commented Dec 20, 2025

Uh oh!

hsliuustc0106 commented Dec 20, 2025

Uh oh!

ZJY0516 commented Dec 22, 2025

Uh oh!

fake0fan commented Dec 22, 2025

Uh oh!

ZJY0516 Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

david6666666 Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

princepride commented Dec 22, 2025

Uh oh!

fake0fan commented Dec 23, 2025

Uh oh!

princepride commented Dec 23, 2025

Uh oh!

erfgss commented Dec 23, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants