ruizheng20 / gpo

The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
16Updated 8 months ago

Alternatives and similar repositories for gpo:

Users that are interested in gpo are comparing it to the libraries listed below