princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
714Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for SimPO