HumanSignal / RLHF

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
214Updated last year

Alternatives and similar repositories for RLHF:

Users that are interested in RLHF are comparing it to the libraries listed below