CUN-bjy / gym-td3-keras
Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆10Updated 3 years ago
Alternatives and similar repositories for gym-td3-keras:
Users that are interested in gym-td3-keras are comparing it to the libraries listed below
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆47Updated 4 years ago
- A simple and fast 2D RL environment with obstacles to learn navigation.☆18Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- The implementation of LSTM-TD3.☆72Updated last year
- Multi Agent Reinforcement Learning for Dense Path Planning☆28Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆43Updated 5 years ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12Updated 4 years ago
- Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single…☆49Updated 5 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- Multi Agent SAC and DDPG applied to path finding in a 3-dimensional grid☆12Updated 3 years ago
- A simple example of how to implement vector based DDPG for MARL tasks using PyTorch and a ML-Agents environment.☆36Updated 6 years ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆30Updated 3 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 5 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 4 months ago
- Master Thesis - Transfer Learning for Mapless Quadrotor Navigation Using Recurrent Neural Network☆16Updated 3 years ago
- Application of an LSTM-based policy gradient on an RL agent☆14Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆49Updated 6 years ago
- MBaranPeker / Pursuit-Evasion-Game-with-Deep-Reinforcement-Learning-in-an-environment-with-an-obstacleIn this study, a multi agent chase-escape problem using Deep Q learning. Actors of the problem are smart evader and smart pursuers with o…☆22Updated last year
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆22Updated 6 years ago
- POMG algorithm for large-scale pursuit game with partial observation and no communication.☆20Updated last year
- Multi Agent adaptation of Soft Actor Critic Reinforcement Learning Algorithm☆17Updated 6 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆17Updated 6 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆26Updated 5 years ago
- Collection of OpenAI parametrized action-space environments.☆62Updated last year
- Multi agent PPO implementation in Pytorch for Unity ML Agents environments.☆25Updated 5 months ago
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆43Updated 5 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆34Updated 3 years ago