This is the official implementation of Multi-Agent PPO (MAPPO).
☆1,902Jul 18, 2024Updated last year
Alternatives and similar repositories for on-policy
Users that are interested in on-policy are comparing it to the libraries listed below
Sorting:
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆526Jul 21, 2023Updated 2 years ago
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆813Oct 23, 2025Updated 4 months ago
- Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…☆1,723Sep 8, 2022Updated 3 years ago
- Python Multi-Agent Reinforcement Learning framework☆2,163Dec 8, 2022Updated 3 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆692Sep 24, 2024Updated last year
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆2,733Apr 9, 2024Updated last year
- Official implementation of HARL algorithms based on PyTorch.☆865Apr 27, 2025Updated 10 months ago
- SMAC: The StarCraft Multi-Agent Challenge☆1,328Feb 18, 2024Updated 2 years ago
- ☆485Dec 28, 2023Updated 2 years ago
- One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)☆1,276Nov 28, 2024Updated last year
- Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.☆717Oct 13, 2022Updated 3 years ago
- ☆223Jun 4, 2023Updated 2 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆712May 18, 2024Updated last year
- Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆1,940Apr 1, 2024Updated last year
- A parallel framework for population-based multi-agent reinforcement learning.☆548Dec 14, 2023Updated 2 years ago
- PyTorch Implementation of MADDPG (Lowe et. al. 2017)☆679Nov 26, 2019Updated 6 years ago
- An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities☆3,321Feb 6, 2026Updated 3 weeks ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆370Mar 16, 2023Updated 2 years ago
- ☆296Feb 15, 2024Updated 2 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆788May 29, 2022Updated 3 years ago
- Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-C…☆675Jul 16, 2022Updated 3 years ago
- Paper list of multi-agent reinforcement learning (MARL)☆4,729Feb 11, 2026Updated 3 weeks ago
- Hello, I pushed some python environments for Multi Agent Reinforcement Learning.☆741May 23, 2022Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆76Jun 9, 2023Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆222Apr 17, 2024Updated last year
- ☆110Oct 25, 2021Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.☆134Jan 17, 2023Updated 3 years ago
- Massively Parallel Deep Reinforcement Learning. 🔥☆4,295Feb 20, 2026Updated last week
- BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…☆574Feb 7, 2026Updated 3 weeks ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆735Apr 12, 2023Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆168Dec 8, 2022Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆173Jan 7, 2024Updated 2 years ago
- Code for Weighted QMIX☆145Nov 12, 2020Updated 5 years ago
- A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)☆690Jun 5, 2018Updated 7 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆139Feb 3, 2021Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- An elegant PyTorch deep reinforcement learning library.☆10,305Dec 1, 2025Updated 3 months ago
- Scalable Multi-Agent RL Training School for Autonomous Driving☆1,109Jan 31, 2025Updated last year
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆83Dec 17, 2024Updated last year