eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)
2,377Updated 6 months ago

Alternatives and similar repositories for direct-preference-optimization:

Users that are interested in direct-preference-optimization are comparing it to the libraries listed below