monoelh / deep-reinforcement-learning_DDQN_PPO_HER
MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
☆19Updated 6 years ago
Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER:
Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- ☆18Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆45Updated 4 years ago
- ☆35Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆100Updated 4 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 6 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆54Updated 2 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆24Updated 9 months ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Updated 6 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Updated 6 years ago
- Autoregressive policies for continuous control reinforcement learning☆29Updated 5 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago