AGI-Eval-Official / CATArenaView on GitHub
CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.
60Dec 25, 2025Updated 2 months ago

Alternatives and similar repositories for CATArena

Users that are interested in CATArena are comparing it to the libraries listed below

Sorting:

Are these results useful?