Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix: Eagle decoding
#3456 opened Apr 10, 2025 by Funatiq Loading…
feat/loraOp
#3455 opened Apr 10, 2025 by danielafrimi Loading…
test: Add DeepSeek-V3-Lite PP=4 cases
#3454 opened Apr 10, 2025 by syuoni Loading…
feat: draft/Lora op
#3453 opened Apr 10, 2025 by danielafrimi Draft
chore: add dgx_h200 tests
#3451 opened Apr 10, 2025 by yiqingy0 Draft
chore: Unify Python NVTX call
#3450 opened Apr 10, 2025 by kaiyux Loading…
fix: Fix PP for llama.
#3449 opened Apr 10, 2025 by yuxianq Loading…
fix: Fix the issues related to fused moe path.
#3435 opened Apr 10, 2025 by hyukn Loading…
Do not merge, internal
#3431 opened Apr 9, 2025 by milesial Draft
feat: Nemotron-H model support
#3430 opened Apr 9, 2025 by vegaluisjose Loading…
chore: unify pp_layers helpers
#3429 opened Apr 9, 2025 by achartier Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.