lmarena / arena-hard-autoView on GitHub
Arena-Hard-Auto: An automatic LLM benchmark.
1,006Jun 21, 2025Updated 8 months ago

Alternatives and similar repositories for arena-hard-auto

Users that are interested in arena-hard-auto are comparing it to the libraries listed below

Sorting:

Are these results useful?