princeton-pli / RLMTView on GitHub
[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
127Oct 27, 2025Updated 5 months ago

Alternatives and similar repositories for RLMT

Users that are interested in RLMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?