Skip to content

Pull requests: vllm-project/vllm-gaudi

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Qwen2.5 vl no alignment
#698 opened Dec 8, 2025 by slokesha Draft
Add 0.12.0 release notes documentation Improvements or additions to documentation skip-gaudi-tests
#694 opened Dec 5, 2025 by mhelf-intel Loading…
Fix the docker image path documentation Improvements or additions to documentation skip-gaudi-tests
#691 opened Dec 5, 2025 by mhelf-intel Loading…
Enable inc dynamic quant for MoE models
#688 opened Dec 4, 2025 by mandy-li Loading…
Add vLLM UBI Dockerfile for Gaudi with RHEL 9.6
#686 opened Dec 4, 2025 by ghandoura Loading…
Reduce defrag operations in non-apc runs
#685 opened Dec 4, 2025 by kamil-kaczor Loading…
Add support for chunked attention (#597)
#683 opened Dec 4, 2025 by jkaniecki Loading…
Add support for chunked attention (#597)
#682 opened Dec 4, 2025 by jkaniecki Loading…
DP: dispatch tensor in FusedMoEMethod
#680 opened Dec 4, 2025 by xinyu-intel Loading…
Initiate CI with libfabric backend
#679 opened Dec 4, 2025 by amathewc Loading…
Add local path option for hf_cache
#662 opened Dec 1, 2025 by PatrykWo Loading…
Optimize MoE via chunk settings
#658 opened Nov 28, 2025 by xinyu-intel Loading…
CustomOp: grouped topk
#647 opened Nov 27, 2025 by xinyu-intel Loading…
make mla weight contiguous
#646 opened Nov 27, 2025 by xinyu-intel Loading…
bucket: add query len 1 to prefill bucket
#645 opened Nov 27, 2025 by xinyu-intel Loading…
Hybrid KV cache for hpu
#644 opened Nov 26, 2025 by michalkuligowski Draft
Fix filter for edge case & prefill bs > 1
#634 opened Nov 26, 2025 by adobrzyn Loading…
ProTip! Exclude everything labeled bug with -label:bug.