junkangwu / beta-DPO

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
29Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for beta-DPO