AI-zone / AlphaGoBangLinks
☆19Updated 7 years ago
Alternatives and similar repositories for AlphaGoBang
Users that are interested in AlphaGoBang are comparing it to the libraries listed below
Sorting:
- https://2017.icml.cc/Conferences/2017/Schedule☆71Updated 7 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84Updated 8 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- Implementations of deep RL papers and random experimentation☆176Updated 7 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆140Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- deep reinforcement learning for personal research☆84Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- Code repository for the paper "Hyperparameter Optimization: A Spectral Approach" by Elad Hazan, Adam Klivans, Yang Yuan.☆173Updated 6 years ago
- A implementation of CF-NADE. Yin Zheng, et. al. "A Neural Autoregressive Approach to Collaborative Filtering", accepted by ICML 2016.☆79Updated 7 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Updated 9 years ago
- Our implementation of the Q-learning algorithms by tensorflow or pytorch. @smsxgz @yangwenhaosms @hzxsnczpku☆8Updated 6 years ago
- An example of data parallelism and async updates of parameter in tensorflow.☆121Updated 6 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago
- ☆159Updated 7 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 6 years ago
- auto-tuning momentum SGD optimizer☆288Updated 6 years ago
- InfoGAN: Interpretable Representation Learning☆153Updated 8 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆72Updated 6 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆55Updated 6 years ago
- Imitation Learning Homework 1☆36Updated 7 years ago
- auto-tuning momentum SGD optimizer☆424Updated 7 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆67Updated 8 years ago
- ☆101Updated 8 years ago
- Tools for PyTorch☆222Updated 2 years ago