thomasgauthier / LLM-self-play
View external linksLinks

Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
29Mar 1, 2024Updated last year

Alternatives and similar repositories for LLM-self-play

Users that are interested in LLM-self-play are comparing it to the libraries listed below

Sorting:

Are these results useful?