alecwangcq / f-divergence-dpo

Direct preference optimization with f-divergences.
13Updated 2 months ago

Alternatives and similar repositories for f-divergence-dpo:

Users that are interested in f-divergence-dpo are comparing it to the libraries listed below