eric-mitchell / direct-preference-optimizationView on GitHub
Reference implementation for DPO (Direct Preference Optimization)
2,855Aug 11, 2024Updated last year

Alternatives and similar repositories for direct-preference-optimization

Users that are interested in direct-preference-optimization are comparing it to the libraries listed below

Sorting:

Are these results useful?