allenai / WildBenchLinks

Benchmarking LLMs with Challenging Tasks from Real Users
244Updated last year

Alternatives and similar repositories for WildBench

Users that are interested in WildBench are comparing it to the libraries listed below

Sorting: