satrams / rent-rlLinks

RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
20Updated last week

Alternatives and similar repositories for rent-rl

Users that are interested in rent-rl are comparing it to the libraries listed below

Sorting: