Skip to content

Commit 6623c50

Browse files
committed
update
1 parent 7362544 commit 6623c50

File tree

1 file changed

+25
-23
lines changed

1 file changed

+25
-23
lines changed

_posts/2025-12-19-vllm-omni-diffusion-cache-acceleration.md renamed to _posts/2025-12-18-vllm-omni-diffusion-cache-acceleration.md

Lines changed: 25 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
1-
___
1+
---
22
layout: post
33
title: "vLLM-Omni Diffusion Cache Acceleration"
44
author: "vLLM-Omni Team"
55
---
66

7-
# Turbocharge Your Diffusion Inference: vLLM-Omni Integrates Cache-DiT and TeaCache
7+
# Turbocharge Your Diffusion Inference
88

99
We are thrilled to announce a major performance update for **vLLM-Omni**.
1010

@@ -31,7 +31,7 @@ vLLM-Omni now supports two distinct caching backends to suit your specific needs
3131

3232

3333
### 2. TeaCache: Simple & Adaptive
34-
TeaCache offers a hook-based, adaptive caching mechanism. It monitors the difference between inputs and dynamically decides when to reuse the transformer computations from the previous timestep.
34+
TeaCache is implemented natively within vLLM-Omni, providing a hook-based, adaptive caching mechanism. It monitors the difference between inputs and dynamically decides when to reuse the transformer computations from the previous timestep.
3535

3636
## Performance Benchmarks
3737

@@ -43,20 +43,21 @@ We benchmarked these methods on NVIDIA H200 GPUs using **Qwen-Image** (1024x1024
4343
| **Qwen-Image** | **TeaCache** | `rel_l1_thresh=0.2` | 10.47s | **1.91x**|
4444
| **Qwen-Image** | **Cache-DiT** | DBCache + TaylorSeer | 10.8s | **1.85x**|
4545

46-
<div style="display: flex; gap: 3%;">
47-
<div style="flex: 1;">
48-
![No Cache](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat.png)
49-
<p style="text-align: center; margin-top: 8px;">no cache</p>
46+
<div style="display: flex; gap: 20px; justify-content: center; align-items: flex-start;">
47+
48+
<div style="flex: 1; text-align: center;">
49+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat.png" alt="No Cache" style="max-width: 100%; height: auto;">
50+
<p style="margin-top: 8px;">No Cache</p>
5051
</div>
5152

52-
<div style="flex: 1;">
53-
![TeaCache](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat_tea_cache.png)
54-
<p style="text-align: center; margin-top: 8px;">TeaCache</p>
53+
<div style="flex: 1; text-align: center;">
54+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat_tea_cache.png" alt="TeaCache" style="max-width: 100%; height: auto;">
55+
<p style="margin-top: 8px;">TeaCache</p>
5556
</div>
5657

57-
<div style="flex: 1;">
58-
![Cache-DiT](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat_cache_dit.png)
59-
<p style="text-align: center; margin-top: 8px;">Cache-DiT</p>
58+
<div style="flex: 1; text-align: center;">
59+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/cat_cache_dit.png" alt="Cache-DiT" style="max-width: 100%; height: auto;">
60+
<p style="margin-top: 8px;">Cache-DiT</p>
6061
</div>
6162
</div>
6263

@@ -69,20 +70,21 @@ For image editing tasks, Cache-DiT shines even brighter. On **Qwen-Image-Edit**,
6970
| **Qwen-Image-Edit** | **TeaCache** | `rel_l1_thresh=0.2` | 35.0s | **1.47x**|
7071
| **Qwen-Image-Edit** | **Cache-DiT** | DBCache + TaylorSeer | 21.6s | **2.38x**|
7172

72-
<div style="display: flex; gap: 3%;">
73-
<div style="flex: 1;">
74-
![No Cache](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_base.png)
75-
<p style="text-align: center; margin-top: 8px;">no cache</p>
73+
<div style="display: flex; gap: 20px; justify-content: center; align-items: flex-start;">
74+
75+
<div style="flex: 1; text-align: center;">
76+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_base.png" alt="No Cache" style="max-width: 100%; height: auto;">
77+
<p style="margin-top: 8px;">No Cache</p>
7678
</div>
7779

78-
<div style="flex: 1;">
79-
![TeaCache](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_tea_cache.png)
80-
<p style="text-align: center; margin-top: 8px;">TeaCache</p>
80+
<div style="flex: 1; text-align: center;">
81+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_tea_cache.png" alt="TeaCache" style="max-width: 100%; height: auto;">
82+
<p style="margin-top: 8px;">TeaCache</p>
8183
</div>
8284

83-
<div style="flex: 1;">
84-
![Cache-DiT](/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_cache_dit.png)
85-
<p style="text-align: center; margin-top: 8px;">Cache-DiT</p>
85+
<div style="flex: 1; text-align: center;">
86+
<img src="/assets/figures/2025-12-19-vllm-omni-diffusion-cache-acceleration/qwen_bear_cache_dit.png" alt="Cache-DiT" style="max-width: 100%; height: auto;">
87+
<p style="margin-top: 8px;">Cache-DiT</p>
8688
</div>
8789
</div>
8890

0 commit comments

Comments
 (0)