-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Dev] [Draft] FP8 params support for megatron-fsdp
#2086
opened Nov 2, 2025 by
kunlunl
Loading…
6 tasks
Fix ambiguous tensor truth-value check in train_rl.loss_func (use .it…
#2085
opened Nov 2, 2025 by
vignesh1507
Loading…
Fix NameError in pretrain_retro.py (add import_module), remove unused…
#2084
opened Nov 2, 2025 by
vignesh1507
Loading…
Records time to first token in the dynamic inference coordinator
#2081
opened Nov 1, 2025 by
sidsingh-nvidia
•
Draft
6 tasks
Add extra RL files
Final Review
Apply this label to indicate that your PR is ready for final review.
Add BytesIO to safe_globals
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Make Apply this label to indicate that your PR is ready for final review.
PipelineParallelLayout always return str from __repr__
Final Review
Hybrid Data x Context Parallelism Feature
enhancement
New feature or request
Expert Review
Apply this label to indicate that your PR is ready for expert review.
M4 + Dist Checkpoint: Replace global parallel state with explicit group parameters
#2053
opened Oct 30, 2025 by
dimapihtar
•
Draft
6 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.