thomasgauthier / LLM-self-play

Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
29Updated 11 months ago

Alternatives and similar repositories for LLM-self-play:

Users that are interested in LLM-self-play are comparing it to the libraries listed below