eric-mitchell / direct-preference-optimizationLinks

Reference implementation for DPO (Direct Preference Optimization)
2,692Updated last year

Alternatives and similar repositories for direct-preference-optimization

Users that are interested in direct-preference-optimization are comparing it to the libraries listed below

Sorting: