uclaml / SPPOView on GitHub
The official implementation of Self-Play Preference Optimization (SPPO)
582Jan 23, 2025Updated last year

Alternatives and similar repositories for SPPO

Users that are interested in SPPO are comparing it to the libraries listed below

Sorting:

Are these results useful?