add nemo_bridge #1050

chai-xiaonan · 2026-01-09T02:51:34Z

Reconstruct the Nemo-Bridge based on the restructured flagscale version. Currently, flagscale has supported some functions of nemo-bridge, enabling the flagscale framework to load and save ckpt in the hf format during the training process. Additionally, in the current version, new features have been added, allowing for the setting of the number of iterations for saving hf weights based on the save_hf_interval. The model has verified that Deepseek V3 16_a3B, Qwen3-32B, and Qwen3-0.6B all have correct accuracy.

lxd-cumt · 2026-01-09T03:06:30Z

flagscale/train/megatron/training/checkpointing.py

+            #Load the HF model from config
+            config_load = args.hf_config_path
+            config = safe_load_config_with_retry(config_load, trust_remote_code=False)
+            bridge = AutoBridge.from_hf_config(config)


Will this save-ckpt step allocate extra GPU memory when initializing an HF model?

lxd-cumt · 2026-01-09T03:09:20Z

flagscale/train/megatron/training/checkpointing.py

+        bridge.load_hf_weights(ddp_model)
+        # no optimizer weight
+        iteration=0
+        num_floating_point_operations_so_far=0


please add print_rank_0 here

lxd-cumt · 2026-01-09T03:11:35Z

flagscale/train/megatron/training/checkpointing.py

+        # use megatron bridge
+        from megatron.nemo_bridge.models import AutoBridge
+        bridge=AutoBridge.from_hf_pretrained(load_dir)
+        bridge.load_hf_weights(ddp_model)


Can nemo-bridge’s load_hf_model handle a ddp_model directly, where ddp_model is wrapped by DistributedDataParallel?

lxd-cumt · 2026-01-09T03:20:46Z

flagscale/train/megatron/nemo_bridge/__init__.py

@@ -0,0 +1,8 @@
+# Copyright (c) 2025, BAAI. All rights reserved.


nemo megatron-bridge supports pip install for usage, ref https://pypi.org/project/megatron-bridge/
please remove source codes

lxd-cumt · 2026-01-09T03:35:34Z

flagscale/train/megatron/nemo_bridge/__init__.py

@@ -0,0 +1,8 @@
+# Copyright (c) 2025, BAAI. All rights reserved.


Rename flagscale/train/megatron/nemo_bridge to flagscale/train/megatron/bridge so that it matches the import pattern from megatron.bridge

tengqm

When copy pasting source code from other repos, we are supposed/obliged to copy paste their copyright notice as well. We cannot claim copyrights for these code.
The original code has following copyright header to be preserved:

# Copyright (c) 2025, NVIDIA CORPORATION.  All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

tengqm · 2026-01-11T03:14:16Z

flagscale/train/megatron/nemo_bridge/models/qwen/qwen2_bridge.py

@@ -0,0 +1,110 @@
+# Copyright (c) 2025, BAAI. All rights reserved.
+#
+# Copied from: https://github.com/NVIDIA-NeMo/Megatron-Bridge


If Megatron-Bridge has a copyright claim, we are supposed to paste their copyright statements here.

…gScale into add_nemo_bridge

add nemo_bridge

d82146d

chai-xiaonan requested review from aoyulong, heavyrain-lzy and zhaoyinglia as code owners January 9, 2026 02:51

lxd-cumt reviewed Jan 9, 2026

View reviewed changes

tengqm reviewed Jan 11, 2026

View reviewed changes

chai-xiaonan added 6 commits January 15, 2026 16:34

Merge branch 'main' into add_nemo_bridge

a358436

Reconstruct the code, and some functions use pip megatron-bridge

90c2ed0

Merge branch 'main' into add_nemo_bridge

224f8e1

Merge branch 'flagos-ai:main' into add_nemo_bridge

c0b0f37

delete readme and swp file

2900956

Merge branch 'add_nemo_bridge' of https://github.com/chai-xiaonan/Fla…

0dd3b6b

…gScale into add_nemo_bridge

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nemo_bridge #1050

add nemo_bridge #1050

chai-xiaonan commented Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

tengqm left a comment

Uh oh!

tengqm Jan 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,8 @@
		# Copyright (c) 2025, BAAI. All rights reserved.

add nemo_bridge #1050

Are you sure you want to change the base?

add nemo_bridge #1050

Conversation

chai-xiaonan commented Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lxd-cumt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lxd-cumt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lxd-cumt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lxd-cumt Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

tengqm left a comment

Choose a reason for hiding this comment

Uh oh!

tengqm Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants