MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.
☆578Apr 7, 2026Updated last week
Alternatives and similar repositories for MCP-Universe
Users that are interested in MCP-Universe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆471Oct 7, 2025Updated 6 months ago
- MCP-based Agent Deep Evaluation System☆148Apr 11, 2026Updated last week
- DSPy Experiments☆10Aug 28, 2025Updated 7 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆374Aug 27, 2025Updated 7 months ago
- One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation t…☆512Apr 11, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- On the Theoretical Limitations of Embedding-Based Retrieval☆646Sep 15, 2025Updated 7 months ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 7 months ago
- gcloud MCP server☆741Updated this week
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆113Apr 2, 2026Updated 2 weeks ago
- ☆38Jan 16, 2026Updated 3 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- A general memory system for agents, powered by deep-research☆841Mar 14, 2026Updated last month
- Open Source AI Database for Voice Agent Transcripts | Call Analysis & Insights | Extraction | Labelling & Classification☆23Nov 3, 2025Updated 5 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Scalable and extensible reinforcement learning for LM agents.☆113Apr 11, 2026Updated last week
- A simple visual test-time scaling method for GUI agent grounding☆23Dec 7, 2025Updated 4 months ago
- Get insights from your research papers with LlamaExtract☆30Aug 8, 2025Updated 8 months ago
- ☆17Apr 30, 2025Updated 11 months ago
- RFIC Inductor Toolkit for ADS, Open Source Version☆69Aug 28, 2025Updated 7 months ago
- Semiont supports human+ai collaborative knowledge work. Use it as: a Wiki, Knowledge Base, Context Graph, Semantic Layer, or Agentic Mem…☆48Updated this week
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- MCP Atlas☆67Updated this week
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An open-source implementation of Whisper☆488Oct 29, 2025Updated 5 months ago
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆811Oct 18, 2025Updated 6 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆276Updated this week
- A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation an…☆20Oct 16, 2025Updated 6 months ago
- ☆93Oct 30, 2025Updated 5 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆248Nov 3, 2024Updated last year
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,366Nov 3, 2025Updated 5 months ago
- ☆17Oct 22, 2024Updated last year
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆558Sep 8, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,494Nov 13, 2025Updated 5 months ago
- LIMI: Less is More for Agency☆161Oct 14, 2025Updated 6 months ago
- [EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models☆19Mar 9, 2025Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆615Aug 21, 2025Updated 7 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆735Feb 4, 2026Updated 2 months ago