Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆121Apr 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for DrMAS
Users that are interested in DrMAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆104Feb 6, 2026Updated 2 months ago
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆26Apr 8, 2026Updated last week
- ☆67Updated this week
- ☆18Mar 2, 2026Updated last month
- ☆64Mar 30, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Method for Long Context RLMs using verifiable Lambda Calculus☆134Apr 1, 2026Updated 2 weeks ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆33Oct 30, 2025Updated 5 months ago
- ☆66Feb 1, 2026Updated 2 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 2 months ago
- ☆33Oct 13, 2025Updated 6 months ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 8 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆59Nov 5, 2025Updated 5 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆40Oct 3, 2025Updated 6 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆183Mar 27, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆71Nov 14, 2024Updated last year
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆68Jan 23, 2026Updated 2 months ago
- ☆24Jan 22, 2025Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 10 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆63May 22, 2025Updated 10 months ago
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- ☆38Jan 25, 2026Updated 2 months ago
- ☆42Mar 26, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆51Feb 10, 2026Updated 2 months ago
- ☆65Jul 14, 2025Updated 9 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Autonomous visual navigation using the depth images☆11Aug 15, 2019Updated 6 years ago
- ☆20Mar 18, 2026Updated 3 weeks ago
- Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"☆51Updated this week
- XmodelLM☆38Nov 19, 2024Updated last year
- ☆19Mar 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆44Jan 6, 2026Updated 3 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆99Oct 27, 2025Updated 5 months ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated 2 years ago
- ☆69Mar 7, 2026Updated last month
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆182Apr 7, 2026Updated last week
- ☆51Feb 25, 2026Updated last month
- ☆61Nov 18, 2024Updated last year