SalesforceAIResearch / MCP-UniverseView external linksLinks
MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents
☆556Updated this week
Alternatives and similar repositories for MCP-Universe
Users that are interested in MCP-Universe are comparing it to the libraries listed below
Sorting:
- MCP-based Agent Deep Evaluation System☆144Sep 26, 2025Updated 4 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- A general memory system for agents, powered by deep-research☆809Dec 3, 2025Updated 2 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆119Oct 9, 2025Updated 4 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Aug 27, 2025Updated 5 months ago
- FeatureBench: Benchmarking Agentic Coding for Complex Feature Development [ICLR 2026]☆18Updated this week
- ☆42Sep 19, 2024Updated last year
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆21Oct 14, 2025Updated 3 months ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- JobMatchAI automates job matching and evaluation processes using AI services (Deepseek & ChatGPT), and Google Sheets integration.☆32Jan 28, 2025Updated last year
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆536Sep 8, 2025Updated 5 months ago
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated last week
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- LLM-based mutation testing☆13Feb 3, 2025Updated last year
- ☆42Jan 19, 2026Updated 3 weeks ago
- ☆12Dec 13, 2023Updated 2 years ago
- AI Agents using Crew AI☆12Jun 16, 2024Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- DSPy Experiments☆10Aug 28, 2025Updated 5 months ago
- ☆13Apr 23, 2025Updated 9 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,975Nov 13, 2025Updated 3 months ago
- ☆27Mar 13, 2024Updated last year
- An MCP server to query any Postgres database in natural language.☆521Sep 25, 2025Updated 4 months ago
- Official implementation of EgoThinker at NIPS 2025☆23Nov 25, 2025Updated 2 months ago
- Data for CyberSOCEval, an LLM benchmark by Meta & CrowdStrike☆18Sep 22, 2025Updated 4 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆37Dec 22, 2025Updated last month
- ☆14Jan 12, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Oct 7, 2025Updated 4 months ago
- A sophisticated CLI assistant to supercharge your Git workflow, powered by multiple AI providers.☆148Aug 20, 2025Updated 5 months ago
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆805Oct 18, 2025Updated 3 months ago
- ☆12Jul 17, 2023Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆10Sep 21, 2022Updated 3 years ago
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆15Jul 11, 2024Updated last year
- Bootstrap project to start your own local AI lab☆16Dec 27, 2025Updated last month
- Get insights from your research papers with LlamaExtract☆30Aug 8, 2025Updated 6 months ago
- ☆17Jul 12, 2025Updated 7 months ago