joeljang / RLPHFView on GitHub
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
118Oct 23, 2023Updated 2 years ago

Alternatives and similar repositories for RLPHF

Users that are interested in RLPHF are comparing it to the libraries listed below

Sorting:

Are these results useful?