ruizheng20 / gpo

The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
14Updated 5 months ago

Related projects

Alternatives and complementary repositories for gpo