-
Notifications
You must be signed in to change notification settings - Fork 451
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer
ready
read for review
ready-for-test
start test by label for PR
#3068
opened Sep 20, 2025 by
linfeng-yuan
Loading…
[CI] Upgrade vLLM to 20250920 (c60e613) and address config break
module:tests
ready
read for review
ready-for-test
start test by label for PR
vllm-break
[bugfix][torchair] fix kv_nz accuracy problem and remove redundant reshape_and_cache operation
ready
read for review
ready-for-test
start test by label for PR
#3066
opened Sep 20, 2025 by
linfeng-yuan
Loading…
【long_seq_optim】update main
ci/build
documentation
Improvements or additions to documentation
module:core
module:ops
module:quantization
module:tests
#3065
opened Sep 20, 2025 by
LookAround0301
Loading…
[Feature] qwen3 qk norm support multi_stream.
module:core
#3060
opened Sep 20, 2025 by
weijinqian0
Loading…
flashcomm1 for shared fused moe
module:core
module:ops
#3058
opened Sep 20, 2025 by
zhaozx-cn
Loading…
Test on guide decoding
merge-conflicts
module:core
module:ops
module:quantization
module:tests
ready-for-test
start test by label for PR
Mmkwarg test
merge-conflicts
module:core
module:ops
module:quantization
module:tests
ready-for-test
start test by label for PR
#3050
opened Sep 20, 2025 by
MengqingCao
•
Draft
[Feature] Reduce host memory usage for attention mask generation
ready
read for review
ready-for-test
start test by label for PR
#3048
opened Sep 20, 2025 by
jianzs
Loading…
[Bugfix][LoRA] Fix LoRA bug after supporting Qwen3-Next
ready
read for review
ready-for-test
start test by label for PR
#3044
opened Sep 19, 2025 by
paulyu12
Loading…
[Structured Output][CI] Update structured output config to sync with upstream
merge-conflicts
module:core
module:tests
#3032
opened Sep 19, 2025 by
shen-shanshan
Loading…
splitting MTP into graph mode and non-graph mode
merge-conflicts
#3030
opened Sep 19, 2025 by
weisirui-eng
Loading…
[Bugfix] Eliminate the redundant handling of index_select in get_splitfuse_att…
#3028
opened Sep 19, 2025 by
tt545571022
Loading…
fix: explicitly setting the tensor shape of otp output to fix shape i…
module:ops
#3027
opened Sep 19, 2025 by
zzhx1
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.