alexanderbaumann99 / PPO-Algorithms
Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.
☆12Updated 3 years ago
Alternatives and similar repositories for PPO-Algorithms:
Users that are interested in PPO-Algorithms are comparing it to the libraries listed below
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 7 months ago
- ☆96Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆74Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- Project on multi agent reinforcement learning applied on patrolling agents☆39Updated 5 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆44Updated 7 months ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 4 years ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆15Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- ☆42Updated 3 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆14Updated 5 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆36Updated 2 years ago
- Code for Weighted QMIX☆135Updated 4 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆77Updated 5 months ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated 2 weeks ago
- ☆28Updated 4 years ago
- There will be updates later☆84Updated 5 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago