joeljang / RLPHF

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
98Updated last year

Alternatives and similar repositories for RLPHF:

Users that are interested in RLPHF are comparing it to the libraries listed below