Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[DOC] Fix path of v1 related figures documentation Improvements or additions to documentation tpu Related to Google TPUs
#21868 opened Jul 29, 2025 by heheda12345 Loading…
3 of 4 tasks
[ci] add b200 test ci/build
#21866 opened Jul 29, 2025 by simon-mo Loading…
4 tasks
[BugFix] Fix interleaved sliding window not set for Gemma3n ready ONLY add when PR is ready to merge/full CI is needed
#21863 opened Jul 29, 2025 by sarckk Loading…
3 of 4 tasks
[Test] Add Benchmark and Unit Test for per_token_group_quant performance Performance-related issues
#21860 opened Jul 29, 2025 by yewentao256 Loading…
add autotune pass v1
#21859 opened Jul 29, 2025 by wenscarl Draft
4 tasks
[Docs] Update docker.md with HF_TOKEN, new model, and podman fix documentation Improvements or additions to documentation force-merge
#21856 opened Jul 29, 2025 by mgoin Loading…
[Docs] Improve docs search experience by limiting code block height in search results documentation Improvements or additions to documentation
#21853 opened Jul 29, 2025 by mgoin Loading…
[Docs] Switch to better markdown linting pre-commit hook ci/build documentation Improvements or additions to documentation performance Performance-related issues
#21851 opened Jul 29, 2025 by hmellor Loading…
Revert "[AMD][CI/Build] Fix the AMD issue caused by inappropriate of symbol exposure (#21647)" ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#21850 opened Jul 29, 2025 by gshtras Loading…
[WIP] Add Kimi-Audio integration for vLLM new-model Requests to new models
#21849 opened Jul 29, 2025 by HelloWorldU Loading…
[Hardware][CPU] Build fix for ARM without BF16
#21848 opened Jul 29, 2025 by ericcurtin Loading…
Migrate MiniCPMOAudioInputs to TensorSchema
#21847 opened Jul 29, 2025 by bbeckca Loading…
Migrate LlavaOnevisionMultiInputs to TensorSchema
#21844 opened Jul 29, 2025 by bbeckca Loading…
Migrate LlavaNextVideoPixelInputs to TensorSchema
#21843 opened Jul 29, 2025 by bbeckca Loading…
[Bugfix] Fixing bug inside MultiModalProfiler. llama Related to Llama models multi-modality Related to multi-modality (#4194)
#21842 opened Jul 29, 2025 by shenoyvvarun Loading…
[Bugfix][PD] set max_completion_tokens=1 if req has this value documentation Improvements or additions to documentation
#21841 opened Jul 29, 2025 by Abirdcfly Loading…
3 of 4 tasks
[Bugfix] Actually disable processing cache when API server is scaled out frontend ready ONLY add when PR is ready to merge/full CI is needed
#21839 opened Jul 29, 2025 by DarkLight1337 Loading…
1 of 4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.