DorianKodelja / DeepMind-Atari-Deep-Q-Learner-2PlayerLinks
☆13Updated 9 years ago
Alternatives and similar repositories for DeepMind-Atari-Deep-Q-Learner-2Player
Users that are interested in DeepMind-Atari-Deep-Q-Learner-2Player are comparing it to the libraries listed below
Sorting:
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 9 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 7 years ago
- ☆30Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 7 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Updated 9 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 8 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- Keras implementation of DQN on ViZDoom environment☆54Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- ☆96Updated 9 years ago
- This is the implementation of paper Model Free Episodic Control☆36Updated 5 years ago
- A Quick and Dirty Progressive Neural Network written in TensorFlow.☆51Updated 7 years ago
- Tensorflow Implementation of Programmable Agents☆35Updated 7 years ago
- ☆56Updated 2 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- Reinforcement learning algorithm implementations and ML experimentation workspace☆43Updated 6 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆65Updated 8 years ago
- ☆28Updated 6 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- Deterministic Policy Gradient using torch7☆43Updated 9 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- ☆101Updated 9 years ago
- ☆17Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- ☆38Updated 8 years ago