thomfoster / minRLHFView on GitHub
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
90Nov 23, 2022Updated 3 years ago

Alternatives and similar repositories for minRLHF

Users that are interested in minRLHF are comparing it to the libraries listed below

Sorting:

Are these results useful?