THUDM / AgentBenchView on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
3,238Feb 8, 2026Updated last month

Alternatives and similar repositories for AgentBench

Users that are interested in AgentBench are comparing it to the libraries listed below

Sorting:

Are these results useful?