Accenture / mcp-benchLinks
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
☆333Updated this week
Alternatives and similar repositories for mcp-bench
Users that are interested in mcp-bench are comparing it to the libraries listed below
Sorting:
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆440Updated this week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆367Updated last month
- ☆501Updated last month
- ☆341Updated 2 weeks ago
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆454Updated this week
- Implementation of 17+ agentic architectures designed for practical use across different stages of AI system development.☆86Updated last week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆267Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆186Updated 2 weeks ago
- Agentic Web: Weaving the Next Web with AI Agents.☆363Updated 2 weeks ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆567Updated 2 weeks ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆635Updated 3 weeks ago
- One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation t…☆338Updated 3 weeks ago
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,202Updated 3 weeks ago
- Codes/Notebooks for AI Projects☆1,049Updated this week
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆1,551Updated 2 weeks ago
- ☆816Updated 2 weeks ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆609Updated last week
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆589Updated 3 weeks ago
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆470Updated last month
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,054Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆436Updated last month
- Desktop app and API created in public for multi-agent Claude Code orchestration - coordinate local and remote agents through @mentions.☆613Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆409Updated 3 weeks ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆440Updated last week
- Agentic RAG, Multi-Agent Systems, and Vision Reasoning are three pipelines to find the perfect LLM☆116Updated last month
- gcloud MCP server☆435Updated this week
- A curated list of awesome open-source libraries for context engineering (Long-term memory, MCP: Model Context Protocol, Prompt/RAG Compre…☆91Updated 3 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆472Updated 2 months ago
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆405Updated last week
- ☆202Updated 2 weeks ago