alecwangcq / f-divergence-dpo

Direct preference optimization with f-divergences.
13Updated 4 months ago

Alternatives and similar repositories for f-divergence-dpo:

Users that are interested in f-divergence-dpo are comparing it to the libraries listed below