MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.
☆575Mar 25, 2026Updated this week
Alternatives and similar repositories for MCP-Universe
Users that are interested in MCP-Universe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.☆400Jan 27, 2026Updated 2 months ago
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆464Oct 7, 2025Updated 5 months ago
- MCP-based Agent Deep Evaluation System☆148Sep 26, 2025Updated 6 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Aug 27, 2025Updated 7 months ago
- One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation t…☆501Sep 8, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- On the Theoretical Limitations of Embedding-Based Retrieval☆639Sep 15, 2025Updated 6 months ago
- gcloud MCP server☆718Updated this week
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,216Sep 12, 2025Updated 6 months ago
- [ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆296Mar 19, 2026Updated last week
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- A general memory system for agents, powered by deep-research☆833Mar 14, 2026Updated 2 weeks ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Scalable and extensible reinforcement learning for LM agents.☆112Mar 12, 2026Updated 2 weeks ago
- Semiont supports human+ai collaborative knowledge work. Use it as: a Semantic Layer, Context Graph, Knowledge Base, Wiki, Annotator, Res…☆37Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A simple visual test-time scaling method for GUI agent grounding☆21Dec 7, 2025Updated 3 months ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- ☆17Apr 30, 2025Updated 10 months ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- MCP Atlas☆53Mar 18, 2026Updated last week
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 11 months ago
- An open-source implementation of Whisper☆486Oct 29, 2025Updated 5 months ago
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆811Oct 18, 2025Updated 5 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆276Oct 14, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆93Oct 30, 2025Updated 5 months ago
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆16Jul 11, 2024Updated last year
- ☆16Jul 17, 2022Updated 3 years ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆247Nov 3, 2024Updated last year
- ☆17Oct 22, 2024Updated last year
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆552Sep 8, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,309Nov 13, 2025Updated 4 months ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Dec 29, 2023Updated 2 years ago
- OpenCUA: Open Foundations for Computer-Use Agents☆722Feb 4, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆609Aug 21, 2025Updated 7 months ago
- [EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models☆19Mar 9, 2025Updated last year
- LIMI: Less is More for Agency☆161Oct 14, 2025Updated 5 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆25Jan 3, 2026Updated 2 months ago
- AI Agent Builder and Runtime by Docker Engineering☆2,719Updated this week