faildeny / Multi_Agent_PPO
Multi agent PPO implementation in Pytorch for Unity ML Agents environments.
☆23Updated last month
Related projects: ⓘ
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆56Updated this week
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆72Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆102Updated 5 months ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- The implementation of LSTM-TD3.☆60Updated last year
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆66Updated 5 months ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆63Updated 6 months ago
- A Reinforcement Learning Project using PPO + LSTM☆37Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆41Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆54Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆70Updated 8 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- ☆13Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆56Updated last year
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆16Updated 6 years ago
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆16Updated 9 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆53Updated 3 months ago
- Transformer in RL for decision-making☆71Updated last year
- ☆87Updated 2 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆43Updated last year
- ☆39Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.☆89Updated last year
- ☆180Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆102Updated 10 months ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆28Updated 3 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆30Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆63Updated last year
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago