-
Notifications
You must be signed in to change notification settings - Fork 597
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature][SpeculativeDecoding]Support tree-attention
#3514
opened Aug 21, 2025 by
freeliuzc
Loading…
[Excutor] Fixed the issue of CUDA graph execution failure caused by d…
#3512
opened Aug 21, 2025 by
gongshaotian
Loading…
[fix] fix output tokens count in streaming completion api
contributor
External developers
#3507
opened Aug 21, 2025 by
liyonghua0910
Loading…
[MetaxGPU] Adapt to the latest fastdeploy on metax gpu
contributor
External developers
#3492
opened Aug 20, 2025 by
Kane2011
Loading…
is_tensor_stream_capturing instead cudaStreamIsCapturing
#3487
opened Aug 20, 2025 by
zhink
Loading…
[CudaGraph] [SOT] Support spliting static graph into piecewise graph with cuda_graph
#3478
opened Aug 19, 2025 by
zyfncg
Loading…
【BugFix】completion接口echo回显支持
contributor
External developers
#3477
opened Aug 19, 2025 by
AuferGachet
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.