thomfoster / minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.
86Updated 2 years ago

Alternatives and similar repositories for minRLHF:

Users that are interested in minRLHF are comparing it to the libraries listed below