alecwangcq / f-divergence-dpo

Direct preference optimization with f-divergences.
12Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for f-divergence-dpo