yoshinobc / M3DDPG-pytorch
MiniMax Multi-Agent Deep Deterministic Policy Gradient (M3DDPG) pytorch implementation
☆10Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for M3DDPG-pytorch
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆39Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆50Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆60Updated last year
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆26Updated last year
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆42Updated 4 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- Codes for paper of 'Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning'☆35Updated last year
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆37Updated 2 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆22Updated 6 years ago
- Source code of paper "A Hierarchical Reinforcement Learning Algorithm Based on Attention Mechanism for UAV Autonomous Navigation"☆10Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆86Updated 8 months ago
- D3QN Pytorch☆53Updated 2 years ago
- my code for paper Parameterized-DQN☆21Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆40Updated 4 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆28Updated 2 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆32Updated 3 weeks ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆16Updated 6 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆110Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆57Updated 2 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆49Updated last year
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆29Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆133Updated last year
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- ☆90Updated 3 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago