amarack / python-rl
Some Reinforcement Learning in Python
☆117Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-rl
- ☆98Updated 8 years ago
- ☆99Updated 8 years ago
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Updated 7 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- An implementation of Deep Q-Network using Caffe☆68Updated 9 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆38Updated 7 years ago
- ☆161Updated 7 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 8 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- RLPy Reinforcement Learning Framework☆251Updated 5 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- Value Iteration Networks☆288Updated 7 years ago
- Reinforcement learning environments for Torch7☆93Updated 7 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆115Updated 8 years ago
- ☆78Updated 6 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆150Updated 2 months ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago