ruizheng20 / gpo

The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
15Updated 7 months ago

Alternatives and similar repositories for gpo:

Users that are interested in gpo are comparing it to the libraries listed below