sail-sg / Cheating-LLM-Benchmarks

[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
68Updated 3 months ago

Alternatives and similar repositories for Cheating-LLM-Benchmarks:

Users that are interested in Cheating-LLM-Benchmarks are comparing it to the libraries listed below