sail-sg / Cheating-LLM-BenchmarksView on GitHub
[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)
84Oct 23, 2024Updated last year

Alternatives and similar repositories for Cheating-LLM-Benchmarks

Users that are interested in Cheating-LLM-Benchmarks are comparing it to the libraries listed below

Sorting:

Are these results useful?