modelscope / MCPBenchLinks
The evaluation benchmark on MCP servers
☆150Updated last month
Alternatives and similar repositories for MCPBench
Users that are interested in MCPBench are comparing it to the libraries listed below
Sorting:
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆155Updated 4 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆209Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆385Updated last week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 4 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆82Updated last week
- ☆210Updated 2 weeks ago
- ☆682Updated last month
- Build, evaluate and train General Multi-Agent Assistance with ease☆345Updated this week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆150Updated 3 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆212Updated last month
- Efficient Agent Training for Computer Use☆114Updated last month
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆355Updated last week
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆176Updated 2 months ago
- ☆75Updated 10 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆514Updated 3 months ago
- MCP-Zero: Active Tool Discovery for Autonomous LLM Agents☆210Updated 2 weeks ago
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆243Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆226Updated 2 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- ☆59Updated 2 months ago
- ☆142Updated 2 months ago
- ☆280Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆422Updated last month
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆168Updated this week
- Inference code of Lingma SWE-GPT☆231Updated 7 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- ☆230Updated last month
- [Up-to-date] Awesome Agentic Deep Research Resources☆255Updated this week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated 3 months ago
- ☆274Updated last month