Skip to content

[Scheduler][Overlap] Skip decode phase for max_new_tokens=1 requests #67830

[Scheduler][Overlap] Skip decode phase for max_new_tokens=1 requests

[Scheduler][Overlap] Skip decode phase for max_new_tokens=1 requests #67830

Triggered via pull request February 3, 2026 11:15
Status Failure
Total duration 28s
Artifacts

pr-test.yml

on: pull_request
check-changes
9s
check-changes
call-gate  /  pr-gate
6s
call-gate / pr-gate
Matrix: Build Wheel
wait-for-stage-a
wait-for-stage-a
Matrix: Build Wheel Arm
jit-kernel-unit-test
jit-kernel-unit-test
stage-a-cpu-only
stage-a-cpu-only
Matrix: multimodal-gen-test-1-gpu
Matrix: multimodal-gen-test-2-gpu
sgl-kernel-unit-test
sgl-kernel-unit-test
sgl-kernel-mla-test
sgl-kernel-mla-test
sgl-kernel-benchmark-test
sgl-kernel-benchmark-test
stage-a-test-1
stage-a-test-1
sgl-kernel-b200-test
sgl-kernel-b200-test
wait-for-stage-b
wait-for-stage-b
Matrix: stage-b-test-large-1-gpu
Matrix: stage-b-test-large-2-gpu
Matrix: stage-b-test-small-1-gpu
stage-b-test-4-gpu-b200
stage-b-test-4-gpu-b200
Matrix: stage-c-test-4-gpu-b200
Matrix: stage-c-test-4-gpu-h100
Matrix: stage-c-test-8-gpu-h20
Matrix: stage-c-test-8-gpu-h200
stage-c-test-deepep-4-gpu
0s
stage-c-test-deepep-4-gpu
stage-c-test-deepep-8-gpu-h200
0s
stage-c-test-deepep-8-gpu-h200
stage-c-test-large-4-gpu
0s
stage-c-test-large-4-gpu
stage-c-test-4-gpu-gb200
0s
stage-c-test-4-gpu-gb200
stage-c-test-large-4-gpu-b200
0s
stage-c-test-large-4-gpu-b200
pr-test-finish
2s
pr-test-finish
Fit to window
Zoom out
Zoom in

Annotations

2 errors
call-gate / pr-gate
Process completed with exit code 1.
pr-test-finish
Process completed with exit code 1.