joeljang / RLPHFView on GitHub
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
118Oct 23, 2023Updated 2 years ago

Alternatives and similar repositories for RLPHF

Users that are interested in RLPHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?