Skip to content

feat(train_reward_model): add chatml formatting and aggregation of more statistics#21

Open
maxreciprocate wants to merge 2 commits intomainfrom
update-reward-trainer
Open

feat(train_reward_model): add chatml formatting and aggregation of more statistics#21
maxreciprocate wants to merge 2 commits intomainfrom
update-reward-trainer

Commits

Commits on Dec 7, 2023