Skip to content

Commit d2adf20

Browse files
committed
update doc
Signed-off-by: David Chen <530634352@qq.com>
1 parent 6cc4d2b commit d2adf20

File tree

4 files changed

+38
-2
lines changed

4 files changed

+38
-2
lines changed

docs/user_guide/acceleration/cache_dit_acceleration.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,27 @@ omni = Omni(
5050
)
5151
```
5252

53+
## Online Serving (OpenAI-Compatible)
54+
55+
Enable Cache-DiT for online serving by passing `--cache-backend cache_dit` when starting the server:
56+
57+
```bash
58+
# Use Cache-DiT default (recommended) parameters
59+
vllm serve Qwen/Qwen-Image --omni --port 8091 --cache-backend cache_dit
60+
```
61+
62+
To customize Cache-DiT settings for online serving, pass a JSON string via `--cache-config`:
63+
64+
```bash
65+
vllm serve Qwen/Qwen-Image --omni --port 8091 \
66+
--cache-backend cache_dit \
67+
--cache-config '{"Fn_compute_blocks": 1, "Bn_compute_blocks": 0, "max_warmup_steps": 4, "residual_diff_threshold": 0.12}'
68+
```
69+
70+
For complete, runnable scripts (including base64 image extraction), see:
71+
72+
- `docs/user_guide/examples/online_serving/text_to_image.md`
73+
- `docs/user_guide/examples/online_serving/image_to_image.md`
5374

5475
## Acceleration Methods
5576

docs/user_guide/acceleration/teacache.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,21 @@ omni = Omni(
3939
)
4040
```
4141

42+
## Online Serving (OpenAI-Compatible)
43+
44+
Enable TeaCache for online serving by passing `--cache-backend tea_cache` when starting the server:
45+
46+
```bash
47+
vllm serve Qwen/Qwen-Image --omni --port 8091 \
48+
--cache-backend tea_cache \
49+
--cache-config '{"rel_l1_thresh": 0.2}'
50+
```
51+
52+
For complete, runnable scripts (including base64 image extraction), see:
53+
54+
- `docs/user_guide/examples/online_serving/text_to_image.md`
55+
- `docs/user_guide/examples/online_serving/image_to_image.md`
56+
4257
## Configuration Parameters
4358

4459
### `rel_l1_thresh` (float, default: `0.2`)

examples/online_serving/image_to_image/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Qwen-Image-Edit Online Serving
1+
# Image-To-Image
22

33
This example demonstrates how to deploy Qwen-Image-Edit model for online image editing service using vLLM-Omni.
44

examples/online_serving/text_to_image/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Qwen-Image Online Serving
1+
# Text-To-Image
22

33
This example demonstrates how to deploy Qwen-Image model for online image generation service using vLLM-Omni.
44

0 commit comments

Comments
 (0)