AI-zone / AlphaGoBang
☆19Updated 7 years ago
Alternatives and similar repositories for AlphaGoBang:
Users that are interested in AlphaGoBang are comparing it to the libraries listed below
- https://2017.icml.cc/Conferences/2017/Schedule☆72Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- deep reinforcement learning for personal research☆84Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆115Updated 8 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆85Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆73Updated 5 years ago
- Implementation of a Variational Auto-Encoder in TensorFlow☆209Updated 7 years ago
- 2nd place solution of NIPS2017 LearningToRun Competition.☆125Updated 2 years ago
- This code is written for the blogs☆272Updated 8 years ago
- This is the project for LS-GAN (Loss-Sensitive GAN)☆213Updated 7 years ago
- ☆160Updated 7 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆167Updated 7 years ago
- ☆99Updated 8 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Apply reinforcement learning to visual attention☆18Updated 8 years ago
- Tensorflow implementation of a Hierarchical and Multiscale RNN, described in https://arxiv.org/abs/1609.01704☆135Updated 7 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆159Updated 7 years ago
- Value Iteration Networks☆290Updated 7 years ago
- An example of data parallelism and async updates of parameter in tensorflow.☆121Updated 6 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Updated 6 years ago
- Tensorflow implementation of deep Q networks in paper 'Playing Atari with Deep Reinforcement Learning'☆163Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- Pointer Networks☆103Updated 9 years ago