gramesh-amd

Follow

gramesh-amd

Follow

0 followers · 2 following

Achievements

Achievements

Popular repositories Loading

slime slime Public

Forked from THUDM/slime

slime is a LLM post-training framework aiming for RL Scaling.

Python 1
miles_upstream miles_upstream Public

Forked from radixark/miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python