eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)
2,323Updated 5 months ago

Alternatives and similar repositories for direct-preference-optimization:

Users that are interested in direct-preference-optimization are comparing it to the libraries listed below