joeljang / RLPHF

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
98Updated last year

Related projects

Alternatives and complementary repositories for RLPHF