watts4speed / fast-dqn-caffe
Optimized dqn for caffe
☆11Updated 8 years ago
Related projects: ⓘ
- Deterministic Policy Gradient using torch7☆44Updated 8 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 7 years ago
- ☆38Updated 7 years ago
- A Quick and Dirty Progressive Neural Network written in TensorFlow.☆53Updated 6 years ago
- Reinforcement learning environments for Torch7☆93Updated 7 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆53Updated 7 years ago
- Deep Q-learning with Caffe on Space Invaders☆19Updated 9 years ago
- Universal library for deep reinforcement learning.☆39Updated 8 years ago
- Unsupervised learning of visual concepts from video☆57Updated 8 years ago
- ☆30Updated this week
- Deep Attention Recurrent Q-Network☆114Updated 8 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 7 years ago
- Cluttered MNIST Dataset☆50Updated 9 years ago
- ☆28Updated 5 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 7 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- ☆70Updated this week
- A lua wrapper for the Arcade Learning Environment/xitari.☆34Updated 8 years ago
- ☆31Updated this week
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆115Updated 8 years ago
- Implementation of a simple example of Q learning in Torch.☆50Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆18Updated 7 years ago
- Model Zoo for Deep Reinforcement Learning☆14Updated 8 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆65Updated 7 years ago
- Code for the Torch in Action book☆45Updated 7 years ago