sail-sg / Cheating-LLM-Benchmarks

[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)
72Updated 4 months ago

Alternatives and similar repositories for Cheating-LLM-Benchmarks:

Users that are interested in Cheating-LLM-Benchmarks are comparing it to the libraries listed below