THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
2,326Updated 2 months ago

Alternatives and similar repositories for AgentBench:

Users that are interested in AgentBench are comparing it to the libraries listed below