modelscope / MCPBenchLinks
The evaluation benchmark on MCP servers
☆213Updated last month
Alternatives and similar repositories for MCPBench
Users that are interested in MCPBench are comparing it to the libraries listed below
Sorting:
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆408Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆266Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆128Updated 6 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆355Updated last month
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆172Updated 7 months ago
- ☆815Updated last month
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆442Updated 2 weeks ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆412Updated last week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆712Updated 2 months ago
- [Up-to-date] Awesome Agentic Deep Research Resources☆490Updated last month
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆216Updated 3 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆447Updated last month
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆160Updated 6 months ago
- AWM: Agent Workflow Memory☆328Updated 8 months ago
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆698Updated this week
- Data Synthesis for Deep Research Based on Semi-Structured Data☆165Updated 2 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆161Updated 4 months ago
- ☆293Updated 4 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆169Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆245Updated 5 months ago
- ☆89Updated 4 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆621Updated 5 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆549Updated 5 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆187Updated last month
- SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasonin…☆173Updated 2 weeks ago
- ☆78Updated last year
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆163Updated last month
- Efficient Agent Training for Computer Use☆131Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆136Updated 7 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆88Updated 2 months ago