Feat/value response by akshay18iitg · Pull Request #109 · TensorAuto/OpenTau

akshay18iitg · 2026-02-07T21:15:49Z

What this does

Adds response to value function so it can be co-trained on robotic as well as VQA dataset.

Important as mentioned in pi*0.6 paper that co-training value function on VQA dataset helps to avoid overfitting.

How it was tested

Run training on local desktop on mixture of libero and pixmo dataset.

Examples:
opentau-train --config_path=configs/examples/value_config.json

How to checkout & try? (for the reviewer)

opentau-train --config_path=configs/examples/value_config.json

Checklist

I have added Google-style docstrings to important functions and ensured function parameters are typed.
My PR includes policy-related changes.
- If the above is checked: I have run the GPU pytests (pytest -m "gpu") and regression tests.

Note: Before submitting this PR, please read the contributor guideline.

akshay18iitg added 5 commits January 30, 2026 10:48

Adding response and handling empty response loss for value function

320aab5

Adding response and handling empty response loss for value function

255ef38

Add inference for value function

6a0ac8f

Supporting co-training for value function

dacbc35

Adding comments

7169224

akshay18iitg requested review from WilliamYue37 and shuheng-liu February 7, 2026 21:15

akshay18iitg self-assigned this Feb 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/value response#109

Feat/value response#109
akshay18iitg wants to merge 5 commits intomainfrom
feat/value_response

akshay18iitg commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

akshay18iitg commented Feb 7, 2026

What this does

How it was tested

How to checkout & try? (for the reviewer)

Checklist

Note: Before submitting this PR, please read the contributor guideline.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant