alexanderbaumann99 / PPO-Algorithms
Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for PPO-Algorithms
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆58Updated 2 months ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆57Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆61Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆57Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- pytorch实现的一些MARL算法☆64Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆112Updated last year
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆16Updated 6 years ago
- Implementation for mSAC methods in PyTorch☆37Updated 3 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆42Updated 4 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆40Updated 4 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- ☆90Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆93Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆12Updated 5 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆31Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆74Updated 10 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆60Updated 5 months ago
- an implementation of ATOC☆14Updated 2 years ago
- Code accompanying paper "Coordinated Proximal Policy Optimization"☆11Updated 2 years ago
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆13Updated last year
- Project on multi agent reinforcement learning applied on patrolling agents☆38Updated 4 years ago