Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆125Feb 11, 2026Updated last month
Alternatives and similar repositories for DrMAS
Users that are interested in DrMAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆95Feb 6, 2026Updated last month
- ☆40Feb 20, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆25Sep 23, 2025Updated 6 months ago
- ☆64Jan 12, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆32Oct 30, 2025Updated 4 months ago
- ☆66Feb 1, 2026Updated last month
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 2 months ago
- ☆33Oct 13, 2025Updated 5 months ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 7 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆37Oct 3, 2025Updated 5 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆88Jan 16, 2026Updated 2 months ago
- ☆24Jan 22, 2025Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 10 months ago
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆49Feb 10, 2026Updated last month
- ☆36Jan 25, 2026Updated 2 months ago
- ☆81Mar 11, 2026Updated 2 weeks ago
- Project page for the NeurIPS 2024 paper, Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication.☆17Dec 6, 2024Updated last year
- ☆42Mar 26, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆65Jul 14, 2025Updated 8 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 6 months ago
- Code for GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts☆16Dec 28, 2024Updated last year
- ☆65Mar 7, 2026Updated 2 weeks ago
- Autonomous visual navigation using the depth images☆11Aug 15, 2019Updated 6 years ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- ☆21Mar 18, 2026Updated last week
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆98Oct 27, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- XmodelLM☆38Nov 19, 2024Updated last year
- ☆19Mar 10, 2025Updated last year
- ☆61Nov 18, 2024Updated last year
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- [MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models☆54Oct 20, 2024Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Jan 16, 2026Updated 2 months ago