THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
2,374Updated 3 weeks ago

Alternatives and similar repositories for AgentBench:

Users that are interested in AgentBench are comparing it to the libraries listed below