Added the megatron-bridge tool to support loading and saving HF files. #1007

chai-xiaonan · 2025-12-31T08:15:06Z

The Nemo-Bridge model has been adapted to FlagScale, enabling FlagScale to support saving and loading checkpoints in HF safe tensor format. Verification was performed on Qwen3-0.6B, Deepseek v3-16_a3B, and Qwen-32B models; saving and loading HF safe tensor format worked without issues, and the accuracy was correct.

lxd-cumt · 2026-01-07T02:17:57Z

Please add an argument, hf-save-steps, to dynamically control how often to save an Hugging Face checkpoint during training.

lxd-cumt · 2026-01-08T08:53:38Z

Please remove patches, and pr to Megatron-LM-FL for megatron/core related modification

Added the megatron-bridge tool to support loading and saving HF files.

66dc05f

chai-xiaonan requested review from aoyulong, heavyrain-lzy and zhaoyinglia as code owners December 31, 2025 08:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the megatron-bridge tool to support loading and saving HF files. #1007

Added the megatron-bridge tool to support loading and saving HF files. #1007

chai-xiaonan commented Dec 31, 2025 •

edited

Loading

Uh oh!

lxd-cumt commented Jan 7, 2026

Uh oh!

lxd-cumt commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added the megatron-bridge tool to support loading and saving HF files. #1007

Are you sure you want to change the base?

Added the megatron-bridge tool to support loading and saving HF files. #1007

Conversation

chai-xiaonan commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lxd-cumt commented Jan 7, 2026

Uh oh!

lxd-cumt commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chai-xiaonan commented Dec 31, 2025 •

edited

Loading