eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)
2,188Updated 3 months ago

Related projects

Alternatives and complementary repositories for direct-preference-optimization