Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆132Apr 1, 2026Updated last month
Alternatives and similar repositories for DrMAS
Users that are interested in DrMAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆38Nov 11, 2025Updated 6 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆54Mar 2, 2026Updated 2 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆112Feb 6, 2026Updated 3 months ago
- ☆46Apr 7, 2026Updated last month
- ☆67Apr 13, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Mar 2, 2026Updated 2 months ago
- ☆64Mar 30, 2026Updated last month
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- ☆43Feb 26, 2026Updated 3 months ago
- ☆28Aug 19, 2025Updated 9 months ago
- VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments☆23Sep 30, 2025Updated 7 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆35May 15, 2026Updated 2 weeks ago
- ☆34Oct 13, 2025Updated 7 months ago
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆60Nov 5, 2025Updated 6 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆40Oct 3, 2025Updated 7 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"