MollyZhang / AlphaGoPolicyNet
Implementing the supervised learning policy networks of AlphaGo
☆13Updated 6 years ago
Related projects: ⓘ
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Updated 8 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- Toolkit designed to ease development of your Deep Neural Network models for the game of Go (weiqi, baduk).☆20Updated 7 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 6 years ago
- RWA in pytorch☆14Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 7 years ago
- Neural Network Models for Multi-label learning☆17Updated 3 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- Atari gauntlet for RL agents☆29Updated 7 years ago
- Reinforcement learning with a convolutional neural network.☆36Updated 9 years ago
- ☆30Updated 7 years ago
- ☆26Updated 6 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆34Updated 7 years ago
- Go Engine based on Monte-Carlo☆10Updated 8 years ago
- Attention is All You Need in Sonnet☆39Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- A Policy Network in Tensorflow to classify chess moves☆18Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- Tensorflow Implementation of Programmable Agents☆36Updated 6 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 8 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18Updated 7 years ago
- ☆14Updated this week
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆18Updated 6 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- A PyTorch implementation of alpha-GAN☆15Updated 7 years ago
- Mozi, Transfer Learning, Multi-Modal Learning, Theano☆27Updated 8 years ago
- Convolutional variational autoencoders and text-question, emoji-answer models☆11Updated 7 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆57Updated 4 months ago
- TensorFlow implementation of LAPGAN (WIP, basically just DCGAN for now)☆11Updated 8 years ago