uclaml / SPPOView on GitHub
The official implementation of Self-Play Preference Optimization (SPPO)
586Jan 23, 2025Updated last year

Alternatives and similar repositories for SPPO

Users that are interested in SPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?