princeton-pli / RLMT
View external linksLinks

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
124Oct 27, 2025Updated 3 months ago

Alternatives and similar repositories for RLMT

Users that are interested in RLMT are comparing it to the libraries listed below

Sorting:

Are these results useful?