Skip to content

Commit 2dd7072

Browse files
author
Cong Chen
committed
Add reference links to vLLM Router blog post
- Add link to llm-d GitHub repository - Add link to Kubernetes Service networking documentation Signed-off-by: Cong Chen <[email protected]>
1 parent 63a684b commit 2dd7072

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

_posts/2025-12-13-vllm-router-release.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -47,8 +47,8 @@ The vllm-router is built with production-grade features for maintaining high ava
4747

4848
We benchmarked the new vLLM Router against two widely used alternatives:
4949

50-
* **[llm-d](https://github.com/llm-d):** A Kubernetes-native routing framework that utilizes default queue-aware load balancing.
51-
* **vLLM-native:** The standard K8s native load balancer, which employs a basic round-robin strategy. Crucially, this option is *not* aware of Prefill/Decode states, treating all pods as identical vLLM replicas.
50+
* **[llm-d](https://github.com/llm-d/llm-d):** A Kubernetes-native routing framework that utilizes default queue-aware load balancing.
51+
* **vLLM-native:** The standard [K8s native load balancer](https://kubernetes.io/docs/concepts/services-networking/), which employs a basic round-robin strategy. Crucially, this option is *not* aware of Prefill/Decode states, treating all pods as identical vLLM replicas.
5252

5353
**Note on Exclusion:** We excluded the vLLM built-in DP/EP coordinator—the recommended [External Load Balancing](https://docs.vllm.ai/en/stable/serving/data_parallel_deployment.html#external-load-balancing) solution for vLLM clusters—from the benchmark. Its throughput was only 1/8 of the others due to a known [performance issue](https://github.com/vllm-project/vllm/issues/24461).
5454

0 commit comments

Comments
 (0)