Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆109Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for DrMAS
Users that are interested in DrMAS are comparing it to the libraries listed below
Sorting:
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 3 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆83Jan 16, 2026Updated last month
- ☆64Jan 12, 2026Updated last month
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- ☆34Jan 25, 2026Updated last month
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- ☆45Feb 25, 2026Updated last week
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆24Sep 23, 2025Updated 5 months ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated 2 weeks ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆35Jan 16, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆39Feb 25, 2026Updated last week
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 6 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆48Dec 25, 2025Updated 2 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆44Feb 10, 2026Updated 3 weeks ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆36Oct 3, 2025Updated 5 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- ☆65Feb 1, 2026Updated last month
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- OmniGAIA: Towards Native Omni-Modal AI Agents☆46Updated this week
- ☆93Dec 30, 2025Updated 2 months ago
- AI model training on heterogeneous, geo-distributed resources☆38Nov 24, 2025Updated 3 months ago
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆78Feb 20, 2026Updated 2 weeks ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆57Nov 5, 2025Updated 4 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆38Jan 23, 2026Updated last month
- On demand communication☆32Feb 26, 2026Updated last week
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- ☆64Jul 14, 2025Updated 7 months ago
- ☆19Mar 10, 2025Updated 11 months ago
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Feb 11, 2026Updated 3 weeks ago
- ☆32Oct 13, 2025Updated 4 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆98Oct 27, 2025Updated 4 months ago
- ☆21Feb 22, 2026Updated last week