Skip to content

[Roadmap]: preparing for 1230 release #165

@hsliuustc0106

Description

@hsliuustc0106

Motivation.

This live page describes the roadmap to v0.12.0 release of vllm-omni, which is in companion with vllm v0.12.0. We also list help wanted item as 🙋in areas that the committer group is seeking more dedicated contributions.

Proposed Change.

CI/CD

P0: E2E test

P1: UT/ST for the following models

Model Support 🙋

P0:

P1:

Docs Refinement

P0:

Core 🙋

P0:

P1:

Disaggregation

P0:

Mode:
P0:

  • (EPD)G

P1:

  • E(PD)G
  • EPDG

Model adaptation:

  • Bagel
  • HunyuanImage-3.0
  • Qwen3-Omni&LongCat-Omni

Hardware:

P0:

  • plugin platform abstraction for multiple hardware registry.

Benchmark 🙋

vLLM alignment and verification: 🙋

P0:

P1:

  • caching
  • parallelism
  • lora
  • multimodal input processing
  • PD/EPD disaggregation

Refactor 🙋

P0:

P1:

  • Simple and Unified init and running arguments setting for both offline and online inference. @tzhouam
  • Unified implementation of stage_worker across offline, async online and multi-node. @Gaohan123

For diffusion supports, please check another independent issue #85

Feedback Period.

No response

CC List.

@Gaohan123 @ywang96 @Isotr0py @DarkLight1337 @david6666666 @ZJY0516

Any Other Things.

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Sub-issues

Metadata

Metadata

Labels

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions