Accenture / mcp-benchLinks
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers
☆368Updated last month
Alternatives and similar repositories for mcp-bench
Users that are interested in mcp-bench are comparing it to the libraries listed below
Sorting:
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆480Updated this week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆367Updated 2 months ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆400Updated last week
- 🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents…☆600Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆231Updated this week
- Agentic Web: Weaving the Next Web with AI Agents.☆386Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated 2 weeks ago
- ☆586Updated 3 weeks ago
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆744Updated last week
- On the Theoretical Limitations of Embedding-Based Retrieval☆600Updated last month
- ☆436Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆479Updated 2 months ago
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆1,859Updated last month
- One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation t…☆351Updated 2 months ago
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,273Updated last week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆268Updated last month
- ☆834Updated 2 months ago
- Implementation of 17+ agentic architectures designed for practical use across different stages of AI system development.☆264Updated last month
- Agents testing framework made easy☆425Updated last week
- OpenCUA: Open Foundations for Computer-Use Agents☆554Updated last month
- [EMNLP 2025] Awesome RAG Reasoning Resources☆344Updated 3 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆712Updated last month
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆607Updated 2 weeks ago
- Agentic RAG, Multi-Agent Systems, and Vision Reasoning are three pipelines to find the perfect LLM☆123Updated 2 months ago
- 🧠 Make your agents learn from experience. Based on the Agentic Context Engineering (ACE) framework.☆555Updated this week
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆350Updated 3 weeks ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆652Updated this week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆490Updated last month
- A completely private and local AI coding assistant, developed by Gensyn. It helps you practice programming problems and train a novel ass…☆663Updated this week
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆727Updated 3 weeks ago