Vance0124 / Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)
126Updated 6 months ago

Alternatives and similar repositories for Token-level-Direct-Preference-Optimization:

Users that are interested in Token-level-Direct-Preference-Optimization are comparing it to the libraries listed below