GAIR-NLP / OctoThinker

Revisiting Mid-training in the Era of RL Scaling
27Updated this week

Alternatives and similar repositories for OctoThinker:

Users that are interested in OctoThinker are comparing it to the libraries listed below