monoelh / deep-reinforcement-learning_DDQN_PPO_HER
MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆20Updated 6 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER:
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Implementation of OpenAI's Evolution Strategies in PyTorch.☆20Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 7 months ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆84Updated 6 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆45Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 5 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- ☆92Updated 4 years ago