AGI-Eval-Official / CATArena
View external linksLinks

CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.
59Dec 25, 2025Updated last month

Alternatives and similar repositories for CATArena

Users that are interested in CATArena are comparing it to the libraries listed below

Sorting:

Are these results useful?