THUDM / AgentBench
View external linksLinks

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
3,162Feb 8, 2026Updated last week

Alternatives and similar repositories for AgentBench

Users that are interested in AgentBench are comparing it to the libraries listed below

Sorting:

Are these results useful?