cmu-l3 / l1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
180Updated last month

Alternatives and similar repositories for l1:

Users that are interested in l1 are comparing it to the libraries listed below