marlbenchmark/off-policy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/marlbenchmark/off-policy)

marlbenchmark / off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

☆526

Alternatives and similar repositories for off-policy

Users that are interested in off-policy are comparing it to the libraries listed below

Sorting:

marlbenchmark / on-policy
View on GitHub
This is the official implementation of Multi-Agent PPO (MAPPO).
☆1,892Jul 18, 2024Updated last year
Lizhi-sjtu / MARL-code-pytorch
View on GitHub
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
☆715Oct 13, 2022Updated 3 years ago
starry-sky6688 / MARL-Algorithms
View on GitHub
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…
☆1,723Sep 8, 2022Updated 3 years ago
hijkzzz / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆710May 18, 2024Updated last year
tinyzqh / light_mappo
View on GitHub
Lightweight version of MAPPO to help you quickly migrate to your local environment.
☆813Oct 23, 2025Updated 4 months ago
oxwhirl / pymarl
View on GitHub
Python Multi-Agent Reinforcement Learning framework
☆2,160Dec 8, 2022Updated 3 years ago
Replicable-MARL / MARLlib
View on GitHub
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
☆1,269Nov 28, 2024Updated last year
PKU-MARL / HARL
View on GitHub
Official implementation of HARL algorithms based on PyTorch.
☆862Apr 27, 2025Updated 10 months ago
PKU-MARL / Multi-Agent-Transformer
View on GitHub
☆485Dec 28, 2023Updated 2 years ago
uoe-agents / epymarl
View on GitHub
An extension of the PyMARL codebase that includes additional algorithms and environment support
☆692Sep 24, 2024Updated last year
shariqiqbal2810 / maddpg-pytorch
View on GitHub
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
☆679Nov 26, 2019Updated 6 years ago
openai / multiagent-particle-envs
View on GitHub
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
☆2,729Apr 9, 2024Updated last year
cyanrain7 / TRPO-in-MARL
View on GitHub
☆223Jun 4, 2023Updated 2 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
View on GitHub
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆221Apr 17, 2024Updated last year
starry-sky6688 / MADDPG
View on GitHub
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-C…
☆675Jul 16, 2022Updated 3 years ago
yangchen1997 / Multi-Agent-Reinforcement-Learning
View on GitHub
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…
☆246Oct 23, 2023Updated 2 years ago
sjtu-marl / malib
View on GitHub
A parallel framework for population-based multi-agent reinforcement learning.
☆548Dec 14, 2023Updated 2 years ago
shariqiqbal2810 / MAAC
View on GitHub
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
☆785May 29, 2022Updated 3 years ago
Bigpig4396 / Multi-Agent-Reinforcement-Learning-Environment
View on GitHub
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
☆741May 23, 2022Updated 3 years ago
tjuHaoXiaotian / pymarl3
View on GitHub
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…
☆173Jan 7, 2024Updated 2 years ago
openai / maddpg
View on GitHub
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
☆1,938Apr 1, 2024Updated last year
oxwhirl / facmac
View on GitHub
☆110Oct 25, 2021Updated 4 years ago
oxwhirl / smac
View on GitHub
SMAC: The StarCraft Multi-Agent Challenge
☆1,328Feb 18, 2024Updated 2 years ago
JohannesAck / tf2multiagentrl
View on GitHub
Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x
☆169Oct 24, 2023Updated 2 years ago
schroederdewitt / multiagent_mujoco
View on GitHub
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆369Mar 16, 2023Updated 2 years ago
lich14 / CDS
View on GitHub
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆88Apr 3, 2023Updated 2 years ago
thesouther / MARL
View on GitHub
多智能体强化学习（MARL）算法复现，包括QMIX，VDN，QTRAN、MAVEN等等
☆214Jun 6, 2022Updated 3 years ago
xuehy / pytorch-maddpg
View on GitHub
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
☆690Jun 5, 2018Updated 7 years ago
nsidn98 / InforMARL
View on GitHub
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
☆143Jul 8, 2025Updated 7 months ago
Farama-Foundation / PettingZoo
View on GitHub
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
☆3,321Feb 6, 2026Updated 3 weeks ago
oxwhirl / smacv2
View on GitHub
☆296Feb 15, 2024Updated 2 years ago
DKuan / MADDPG_torch
View on GitHub
The code for maddpg using pytorch
☆168Oct 5, 2020Updated 5 years ago
philtabor / Multi-Agent-Deep-Deterministic-Policy-Gradients
View on GitHub
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
☆380Apr 8, 2021Updated 4 years ago
zoeyuchao / mappo
View on GitHub
This is the official implementation of Multi-Agent PPO.
☆133Jan 17, 2023Updated 3 years ago
oxwhirl / wqmix
View on GitHub
Code for Weighted QMIX
☆145Nov 12, 2020Updated 5 years ago
mttga / pymarl_transformers
View on GitHub
Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…
☆58Apr 13, 2024Updated last year
Theohhhu / UPDeT
View on GitHub
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…
☆139Feb 3, 2021Updated 5 years ago
TimeBreaker / MARL-papers-with-code
View on GitHub
Multi-Agent Reinforcement Learning (MARL) papers with code
☆416Sep 15, 2022Updated 3 years ago
hijkzzz / noisy-mappo
View on GitHub
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
☆76Jun 9, 2023Updated 2 years ago