princeton-pli / RLMTLinks

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
90Updated last week

Alternatives and similar repositories for RLMT

Users that are interested in RLMT are comparing it to the libraries listed below

Sorting: