OpenGenerativeAI / llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
1,416Updated last month

Alternatives and similar repositories for llm-colosseum:

Users that are interested in llm-colosseum are comparing it to the libraries listed below