satrams / rent-rlLinks

RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
28Updated 3 weeks ago

Alternatives and similar repositories for rent-rl

Users that are interested in rent-rl are comparing it to the libraries listed below

Sorting: