Fix local batch size #467

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

younik merged 2 commits into master from fix-minibatch-size

Feb 5, 2026

Merged

Fix local batch size #467

tutorials/examples/train_hypergrid.py

-Original file line number
+Diff line change
@@ Expand Up @@
         gflownet = gflownet.to(device)
         n_iterations = ceil(args.n_trajectories / args.batch_size)
-        per_node_batch_size = args.batch_size // distributed_context.world_size
+        per_node_batch_size = args.batch_size // distributed_context.num_training_ranks
         modes_found = set()
         # n_pixels_per_mode = round(env.height / 10) ** env.ndim
         # Note: on/off-policy depends on the current strategy; recomputed inside the loop.
@@ Expand All @@
             )
             prof.start()
-        if args.distributed:
-            # Create and start error handler.
-            def cleanup():
-                logger.info("Process %d: Cleaning up...", rank)
-            rank = torch.distributed.get_rank()
-            torch.distributed.get_world_size()
         # Initialize some variables before the training loop.
         timing = {}
         time_start = time.time()
@@ Expand Down Expand Up / @@ -897,7 +889,7 @@ def cleanup(): @@
                 )
                 trajectories = gflownet.sample_trajectories(
                     env,
-                    n=args.batch_size,
+                    n=per_node_batch_size,
                     save_logprobs=is_on_policy_iter,  # Reuse on-policy log-probs.
                     save_estimator_outputs=not is_on_policy_iter,  # Off-policy caches estimator outputs.
                     epsilon=float(getattr(args, "agent_epsilon", 0.0)),
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix local batch size #467

Uh oh!

Diff view

Diff view

There are no files selected for viewing

josephdviviano Feb 5, 2026

Uh oh!

Uh oh!

Fix local batch size #467

Uh oh!

Fix local batch size #467

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

josephdviviano Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!