ehknight / natural-gradient-deep-q-learning
☆22Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for natural-gradient-deep-q-learning
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- ☆69Updated 6 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- ☆30Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆7Updated 7 years ago
- Models built with TensorFlow☆25Updated 5 years ago
- Skip Context Tree Switching - Reference Implementation☆49Updated 7 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- This is the implementation of paper Model Free Episodic Control☆37Updated 5 years ago
- ☆24Updated 9 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- ☆38Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Reinforcement Learning framework to facilitate development and use of scalable RL algorithms and applications☆62Updated 6 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Updated 7 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- ☆28Updated 5 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆53Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- A Python library for reinforcement learning using Bayesian approaches☆52Updated 9 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- Asynchronous Advantage Actor Critic☆21Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 6 years ago