Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Revert "[dev] Add assertion for mxfp8 params without dp overlap (#2270)" dev2main: mbridge dev to main: this PR is needed in main for mbridge Final Review Apply this label to indicate that your PR is ready for final review.
#2901 opened Jan 10, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
Cherrypick 1989
#2900 opened Jan 10, 2026 by ko3n1g Draft
6 tasks
Core 0.16
Ko3n1g/chore/stack prs 260110
#2899 opened Jan 10, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
Remove unused FlashAttention3 args
#2898 opened Jan 10, 2026 by santhnm2 Loading…
6 tasks
Core 0.16
Maanug/transfcfg generated args v2
#2896 opened Jan 10, 2026 by maanug-nv Draft
6 tasks
[Megatron-FSDP] Test FP8 activations + parameter sharding with Megatron-FSDP fully-shard. Update README. Final Review Apply this label to indicate that your PR is ready for final review.
#2894 opened Jan 10, 2026 by cspades Loading…
3 of 6 tasks
Core 0.16
Use different token for assign logic
#2893 opened Jan 9, 2026 by Phlip79 Loading…
6 tasks
Core 0.16
Support custom Router implementations in MoELayer community-request
#2891 opened Jan 9, 2026 by nschank Loading…
2 of 6 tasks
ci: Remove Github transition comment from CI
#2881 opened Jan 8, 2026 by chtruong814 Loading…
6 tasks
Core 0.16
Test on Muon integration Run CICD
#2880 opened Jan 8, 2026 by BoxiangW Loading…
6 tasks
Core 0.16
RL refit pipelining support
#2878 opened Jan 8, 2026 by wdykas Loading…
6 tasks
Core 0.16
handle 'step' state in checkpoint save
#2874 opened Jan 8, 2026 by ahmadki Draft
6 tasks
Add a logprobs test with real gpt model. Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests
#2870 opened Jan 8, 2026 by yobibyte Loading…
6 tasks
Core 0.16
[WIP Feat] Split-K Indexer Kernels community-request needs-follow-up Issue needs follow-up
#2869 opened Jan 8, 2026 by laixinn Draft
7 of 17 tasks
Only build datasets on the required ranks
#2865 opened Jan 8, 2026 by asolergi-nv Loading…
Update Slack user group when oncall changes
#2859 opened Jan 8, 2026 by Phlip79 Loading…
6 tasks
Core 0.16
ProTip! What’s not been updated in a month: updated:<2025-12-11.