THUDM / AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆2,374Updated 3 weeks ago
Alternatives and similar repositories for AgentBench:
Users that are interested in AgentBench are comparing it to the libraries listed below
- AgentTuning: Enabling Generalized Agent Abilities for LLMs