Feat: support ONNX exportation and loading by shuheng-liu · Pull Request #103 · TensorAuto/OpenTau

shuheng-liu · 2026-02-04T22:39:04Z

What this does

Explain what this PR does. Feel free to tag your PR with the appropriate label(s).

Examples:

Title	Label
Fixes #[issue]	(🐛 Bug)
Adds new Feature	(🗃️ Feature)
Optimizes something	(⚡️ Performance)
Updates docs	(📝 Documentation)

How it was tested

Explain/show how you tested your changes.

Examples:

Added test_something in tests/test_stuff.py.
Added new_feature and checked that training converges with policy X on dataset/environment Y.
Optimized some_function, it now runs X times faster than previously.

How to checkout & try? (for the reviewer)

Provide a simple way for the reviewer to try out your changes.

Examples:

pytest -sx tests/test_stuff.py::test_something

python src/opentau/scripts/train.py --some.option=true

opentau-train --config_path=configs/examples/pi05_training_config.json

Checklist

I have added Google-style docstrings to important functions and ensured function parameters are typed.
My PR includes policy-related changes.
- If the above is checked: I have run the GPU pytests (pytest -m "gpu") and regression tests.

Note: Before submitting this PR, please read the contributor guideline.

Copilot

Pull request overview

This PR adds ONNX export and inference capabilities for the PI05 policy model. The implementation separates traceable neural network operations (exported to ONNX) from non-traceable operations like tokenization and state discretization (handled externally in Python).

Changes:

Adds onnx_inference.py script for running inference with exported ONNX models
Refactors export_to_onnx.py to support PI05 using the Dynamo ONNX exporter with pre-tokenized inputs
Updates dtype handling in PI05 model files to use float32 during ONNX export instead of bfloat16

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
src/opentau/scripts/onnx_inference.py	New inference script for ONNX models with pre-tokenization, image preprocessing, and action sampling
src/opentau/scripts/export_to_onnx.py	Refactored to export PI05 models with pre-tokenized inputs using Dynamo exporter
src/opentau/policies/pi05/paligemma_with_expert.py	Fixed Cache import from transformers; added dtype selection for ONNX export; guards bfloat16 casting during compilation
src/opentau/policies/pi05/modeling_pi05.py	Added dtype selection function for ONNX export compatibility
src/opentau/policies/normalize.py	Skips validation assertions during ONNX export and torch compilation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-04T22:44:08Z

src/opentau/scripts/export_to_onnx.py

+        Returns:
+            Action tensor of shape (batch, n_action_steps, action_dim).
+        """
+        print("Starting forward pass of the wrapper...")


Debug print statements should be removed or replaced with proper logging. These print statements will be executed during ONNX export tracing, which may clutter the output. Consider using logging.debug() instead.

Copilot · 2026-02-04T22:44:08Z

src/opentau/scripts/export_to_onnx.py

+        # `policy.model.forward` returns a (batch_size, n_action_steps, action_dim) tensor, but the queue
+        # effectively has shape (n_action_steps, batch_size, *), hence the transpose.
+        actions = actions.transpose(0, 1)
+        print("Finished forward pass of the wrapper")


Debug print statement should be removed or replaced with proper logging. Consider using logging.debug() instead.

src/opentau/scripts/export_to_onnx.py

Copilot · 2026-02-04T22:44:08Z

src/opentau/scripts/export_to_onnx.py

+        # `policy.model.forward` returns a (batch_size, n_action_steps, action_dim) tensor, but the queue
+        # effectively has shape (n_action_steps, batch_size, *), hence the transpose.
+        actions = actions.transpose(0, 1)


The comment states "policy.model.forward returns a (batch_size, n_action_steps, action_dim) tensor" but this is inside the forward method of the wrapper, and the code is calling policy.model.sample_actions, not policy.model.forward. The comment appears to be outdated or incorrect. Additionally, the transpose operation converts from (batch, n_action_steps, action_dim) to (n_action_steps, batch, action_dim), which contradicts the ONNX export that declares the output as "actions" without specifying this transposition in the documentation.

Copilot · 2026-02-04T22:44:08Z

src/opentau/scripts/onnx_inference.py

+    providers = provider or (
+        ["CUDAExecutionProvider", "CPUExecutionProvider"]
+        if ort.get_device() == "GPU"
+        else ["CPUExecutionProvider"]
+    )


The provider parameter is a string but is assigned directly to providers which expects a list. When a user passes a string (e.g., "CUDAExecutionProvider"), it will be used as-is instead of being wrapped in a list. This should be: providers = [provider] if provider else (...)

src/opentau/scripts/export_to_onnx.py

Copilot · 2026-02-04T22:44:09Z

src/opentau/scripts/export_to_onnx.py

+            *images: Variable number of image tensors, each of shape (batch, 3, H, W).
+
+        Returns:
+            Action tensor of shape (batch, n_action_steps, action_dim).


The docstring states the return shape is (batch, n_action_steps, action_dim), but the actual return shape after the transpose on line 118 is (n_action_steps, batch, action_dim). This inconsistency will confuse users of the exported ONNX model. The docstring should be updated to match the actual output shape.

Suggested change

Action tensor of shape (batch, n_action_steps, action_dim).

Action tensor of shape (n_action_steps, batch, action_dim).

src/opentau/scripts/export_to_onnx.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

shuheng-liu added 4 commits February 2, 2026 16:33

wip

a63a1fc

fix: export the model in fp32 using torch dynamo

9396fd8

feat: working ONNX

6d8e949

refactor: remove repetitive image processing code

93aa1af

Copilot AI review requested due to automatic review settings February 4, 2026 22:39

Copilot started reviewing on behalf of shuheng-liu February 4, 2026 22:39 View session

Copilot AI reviewed Feb 4, 2026

View reviewed changes

shuheng-liu marked this pull request as draft February 4, 2026 22:57

shuheng-liu changed the title ~~Feat: support exporting model to ONNX and load it~~ Feat: support ONNX exportation and loading Feb 4, 2026

shuheng-liu and others added 3 commits February 4, 2026 15:45

feat(inference): log both mean and std of ONNX inference

10c7af4

feat: load ONNX model on TensorRT and run inference benchmark

7fcd30a

Apply suggestion from @Copilot

b225835

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: support ONNX exportation and loading#103

Feat: support ONNX exportation and loading#103
shuheng-liu wants to merge 7 commits intomainfrom
feat/onnx-export

shuheng-liu commented Feb 4, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	Action tensor of shape (batch, n_action_steps, action_dim).
	Action tensor of shape (n_action_steps, batch, action_dim).

Conversation

shuheng-liu commented Feb 4, 2026

What this does

How it was tested

How to checkout & try? (for the reviewer)

Checklist

Note: Before submitting this PR, please read the contributor guideline.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant