Paul33333 / SFT-and-DPOView on GitHub
This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
18Jan 9, 2025Updated last year

Alternatives and similar repositories for SFT-and-DPO

Users that are interested in SFT-and-DPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?