cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Reinforcement Learning
☆36Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for deeprl-baselines
- RainBow, Tensorflow☆49Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Updated 5 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆131Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆77Updated 5 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆33Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆44Updated 6 years ago
- ☆69Updated 5 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Hierarchical Deep RL Network☆30Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- ☆35Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago