princeton-pli / RLMTLinks

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
118Updated 3 weeks ago

Alternatives and similar repositories for RLMT

Users that are interested in RLMT are comparing it to the libraries listed below

Sorting: