Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆49Updated 3 weeks ago
Alternatives and similar repositories for DDPG_PER:
Users that are interested in DDPG_PER are comparing it to the libraries listed below
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆163Updated 11 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆78Updated 4 years ago
- ☆96Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆72Updated 4 months ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- ☆40Updated 3 years ago
- The implementation of LSTM-TD3.☆76Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 7 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆14Updated 4 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆47Updated 5 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆63Updated 2 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆46Updated 5 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆39Updated 6 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆71Updated 9 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆120Updated 11 months ago
- Code for Weighted QMIX☆133Updated 4 years ago
- There will be updates later☆84Updated 5 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆53Updated last year
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 5 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆42Updated 3 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆142Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆69Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆54Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆46Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆50Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆15Updated last year
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago