musyoku / double-dqn
Chainer implementation of Double Deep Q-Network (Double DQN)
☆27Updated 8 years ago
Related projects: ⓘ
- Keras implementation of DQN on ViZDoom environment☆53Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 7 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 8 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆91Updated 7 years ago
- Deep Attention Recurrent Q-Network☆114Updated 8 years ago
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Updated 7 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- An implementation of Deep Q-Network using Caffe☆67Updated 8 years ago
- Reinforcement learning environments for Torch7☆93Updated 7 years ago
- Unfinished. Deep Q Learning in Tensorflow for ATARI.☆85Updated 8 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Updated 7 years ago
- ☆117Updated this week
- Deterministic Policy Gradient using torch7☆44Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- Universal library for deep reinforcement learning.☆39Updated 8 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- ☆99Updated 8 years ago
- ☆33Updated this week
- ☆68Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- ☆98Updated 8 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆53Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- ☆28Updated 5 years ago
- This is the implementation of paper Model Free Episodic Control☆37Updated 4 years ago
- ☆78Updated 6 years ago
- ☆38Updated 7 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Updated 6 years ago