PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
☆526Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for off-policy
Users that are interested in off-policy are comparing it to the libraries listed below
Sorting:
- This is the official implementation of Multi-Agent PPO (MAPPO).☆1,892Jul 18, 2024Updated last year
- Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.☆715Oct 13, 2022Updated 3 years ago
- Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…☆1,723Sep 8, 2022Updated 3 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆710May 18, 2024Updated last year
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆813Oct 23, 2025Updated 4 months ago
- Python Multi-Agent Reinforcement Learning framework☆2,160Dec 8, 2022Updated 3 years ago
- One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)☆1,269Nov 28, 2024Updated last year
- Official implementation of HARL algorithms based on PyTorch.☆862Apr 27, 2025Updated 10 months ago
- ☆485Dec 28, 2023Updated 2 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆692Sep 24, 2024Updated last year
- PyTorch Implementation of MADDPG (Lowe et. al. 2017)☆679Nov 26, 2019Updated 6 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆2,729Apr 9, 2024Updated last year
- ☆223Jun 4, 2023Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆221Apr 17, 2024Updated last year
- Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-C…☆675Jul 16, 2022Updated 3 years ago
- PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…☆246Oct 23, 2023Updated 2 years ago
- A parallel framework for population-based multi-agent reinforcement learning.☆548Dec 14, 2023Updated 2 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆785May 29, 2022Updated 3 years ago
- Hello, I pushed some python environments for Multi Agent Reinforcement Learning.☆741May 23, 2022Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆173Jan 7, 2024Updated 2 years ago
- Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆1,938Apr 1, 2024Updated last year
- ☆110Oct 25, 2021Updated 4 years ago
- SMAC: The StarCraft Multi-Agent Challenge☆1,328Feb 18, 2024Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆169Oct 24, 2023Updated 2 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆369Mar 16, 2023Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Apr 3, 2023Updated 2 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆214Jun 6, 2022Updated 3 years ago
- A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)☆690Jun 5, 2018Updated 7 years ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆143Jul 8, 2025Updated 7 months ago
- An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities☆3,321Feb 6, 2026Updated 3 weeks ago
- ☆296Feb 15, 2024Updated 2 years ago
- The code for maddpg using pytorch☆168Oct 5, 2020Updated 5 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆380Apr 8, 2021Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.☆133Jan 17, 2023Updated 3 years ago
- Code for Weighted QMIX☆145Nov 12, 2020Updated 5 years ago
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆58Apr 13, 2024Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆139Feb 3, 2021Updated 5 years ago
- Multi-Agent Reinforcement Learning (MARL) papers with code☆416Sep 15, 2022Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆76Jun 9, 2023Updated 2 years ago