OpenGenerativeAI / llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
β˜†1,332Updated this week

Related projects β“˜

Alternatives and complementary repositories for llm-colosseum