MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents
☆567Mar 6, 2026Updated this week
Alternatives and similar repositories for MCP-Universe
Users that are interested in MCP-Universe are comparing it to the libraries listed below
Sorting:
- MCP-based Agent Deep Evaluation System☆145Sep 26, 2025Updated 5 months ago
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆453Oct 7, 2025Updated 5 months ago
- A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation an…☆20Oct 16, 2025Updated 4 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- The world’s first science-focused human-AI Agent collaborative discussion community.☆45Updated this week
- One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation t…☆495Sep 8, 2025Updated 6 months ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 5 months ago
- [ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆235Feb 28, 2026Updated last week
- A general memory system for agents, powered by deep-research☆823Feb 28, 2026Updated last week
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,208Sep 12, 2025Updated 5 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Aug 27, 2025Updated 6 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆125Oct 9, 2025Updated 5 months ago
- ☆44Sep 19, 2024Updated last year
- gcloud MCP server☆697Updated this week
- ☆11Nov 3, 2021Updated 4 years ago
- ☆15Mar 13, 2025Updated 11 months ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- ☆11Jul 21, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,135Nov 13, 2025Updated 3 months ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆634Sep 15, 2025Updated 5 months ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 9 months ago
- DSPy Experiments☆10Aug 28, 2025Updated 6 months ago
- LLM-based mutation testing☆14Feb 3, 2025Updated last year
- AI Agents using Crew AI☆12Jun 16, 2024Updated last year
- ☆13Dec 13, 2023Updated 2 years ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆544Sep 8, 2025Updated 6 months ago
- An open-source implementation of Whisper☆479Oct 29, 2025Updated 4 months ago
- Code and Data for Tau-Bench☆1,114Aug 28, 2025Updated 6 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆602Aug 21, 2025Updated 6 months ago
- A no-install needed web-GUI for Ollama.☆449Feb 6, 2026Updated last month
- ☆93May 16, 2025Updated 9 months ago
- An MCP server to query any Postgres database in natural language.☆524Sep 25, 2025Updated 5 months ago
- Automatic Trait Implementation by Induction☆56Updated this week
- LangChain Tutorial☆13Feb 23, 2024Updated 2 years ago
- Open source chat based on HuggingChat☆36Updated this week