icip-cas / LiveMCPBench
View external linksLinks

LiveMCPBench is a benchmark for evaluating the ability of agents to navigate and utilize a large-scale MCP toolset. It provides a comprehensive set of tasks that challenge agents to effectively use various tools in daily scenarios.
92Dec 18, 2025Updated last month

Alternatives and similar repositories for LiveMCPBench

Users that are interested in LiveMCPBench are comparing it to the libraries listed below

Sorting:

Are these results useful?