Use number of atoms for batch size #1690

lbluque · 2025-12-18T23:43:38Z

The next ray.serve release will include the option to use a function to determine batch sizes based on input data. ray-project/ray#59059

This PR edits our code to use total number of atoms to determine batch sizes in the BatchPredictServer.

src/fairchem/core/units/mlip_unit/_batch_serve.py

mshuaibii · 2026-01-08T18:52:51Z

src/fairchem/core/units/mlip_unit/_batch_serve.py


-    @serve.batch
+    @serve.batch(
+        batch_size_fn=lambda batch: sum(sample.natoms.sum() for sample in batch).item()


naive question - how is @serve.batch working here. how does batch_size_fn get incorporated

Great question! Thats all implemented in ray. The TLDR is that a BatchQueue class that gets the batch_size_fn is instantiated in the serve.batch decorator.

If you're curious you can see the implementation of the decorator here: https://github.com/ray-project/ray/blob/d4817998ee8476c138e9106280ecefdf1e59ba6b/python/ray/serve/batching.py#L677

src/fairchem/core/units/mlip_unit/_batch_serve.py

kjmichel · 2026-01-09T23:02:24Z

src/fairchem/core/calculate/_batch.py

        self,
        predict_unit: MLIPPredictUnit,
-        max_batch_size: int = 16,
+        max_batch_size: int = 512,


It looks like this went from 16 -> 512 here and 32 -> 512 in _batch_serve.py.

Is this interpreted differently now? Before batch size meant the number of structures in the batch, but is it now compared directly to the output of batch_size_fn (which is the number of atoms across all structures)?

Yes, thats correct! So the default batch size is set to 512 total number of atoms across all structures in the batch. Though this is approximate, it will break a batch as soon as the total number of atoms is larger than 512.

lbluque added 2 commits December 18, 2025 23:38

use natoms for batch size

31c97c0

only use natoms batching

882a70a

lbluque added enhancement New feature or request patch Patch version release labels Dec 18, 2025

meta-cla bot added the cla signed label Dec 18, 2025

lbluque marked this pull request as draft December 18, 2025 23:43

Merge branch 'main' into batch-natoms

6f15c65

lbluque marked this pull request as ready for review January 6, 2026 20:59

lbluque requested a review from rayg1234 January 6, 2026 21:00

Merge branch 'main' into batch-natoms

264f669

lbluque requested review from kjmichel and mshuaibii January 8, 2026 18:43

mshuaibii reviewed Jan 8, 2026

View reviewed changes

src/fairchem/core/units/mlip_unit/_batch_serve.py Outdated Show resolved Hide resolved

mshuaibii reviewed Jan 8, 2026

View reviewed changes

set default batch size to 512 only

36c748a

lbluque requested a review from mshuaibii January 9, 2026 20:54

kjmichel reviewed Jan 9, 2026

View reviewed changes

update max_batch_size docstring

d21cd87

lbluque requested a review from kjmichel January 10, 2026 00:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use number of atoms for batch size #1690

Use number of atoms for batch size #1690

Uh oh!

lbluque commented Dec 18, 2025

Uh oh!

Uh oh!

mshuaibii Jan 8, 2026

Uh oh!

lbluque Jan 9, 2026

Uh oh!

Uh oh!

kjmichel Jan 9, 2026

Uh oh!

lbluque Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use number of atoms for batch size #1690

Are you sure you want to change the base?

Use number of atoms for batch size #1690

Uh oh!

Conversation

lbluque commented Dec 18, 2025

Uh oh!

Uh oh!

mshuaibii Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

lbluque Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kjmichel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lbluque Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants