MollyZhang / AlphaGoPolicyNetLinks

Implementing the supervised learning policy networks of AlphaGo

☆12

Alternatives and similar repositories for AlphaGoPolicyNet

Users that are interested in AlphaGoPolicyNet are comparing it to the libraries listed below

Sorting:

zkailinzhang / Py_Alphago
Monte Carlo Tree Search (MCTS) ,realize using python
☆11Updated 9 years ago
yao62995 / Renju-AI
a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"
☆20Updated 9 years ago
abhishek-kumar / NNForMLL
Neural Network Models for Multi-label learning
☆17Updated 4 years ago
jsikyoon / a3c-distributed_tensorflow
Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning
☆29Updated 7 years ago
Avmb / lowrank-highwaynetwork
Low-rank Highway Networks
☆13Updated 9 years ago
Islandman93 / reinforcepy
Collection of reinforcement learners implemented in python. Mainly including DQN and its variants
☆54Updated 8 years ago
domarps / papers-i-read
Summaries and notes on recent Deep Learning literature
☆10Updated 5 years ago
woodrush / vgg-visualizer-tf
VGG feature visualizer in TensorFlow
☆10Updated 9 years ago
talolard / DenseContinuousSentances
An aspiring attempt to generate a continuous space of sentences with DenseNet
☆26Updated 8 years ago
jjkke88 / RL_toolbox
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
☆42Updated 7 years ago
FrownyFace / DNC
Differentiable neural computers
☆27Updated 8 years ago
dbobrenko / async-deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
☆42Updated 7 years ago
jimfleming / LAPGAN
TensorFlow implementation of LAPGAN (WIP, basically just DCGAN for now)
☆11Updated 9 years ago
dirkweissenborn / dual_am_rnn
Tensorflow Implementation of the (Dual)-Associative Memory GRUs
☆18Updated 9 years ago
paulbertens / rank-ordered-autoencoder
Rank Ordered Autoencoder implementation as described in https://arxiv.org/abs/1605.01749
☆34Updated 9 years ago
PFCM / neural-episodic-control
☆30Updated 8 years ago
awbrown90 / DeepReinforcementLearning
☆26Updated 7 years ago
coxlab / tsnet
Tensor Switching Networks
☆12Updated 7 years ago
cod3licious / simec-theano
SimEc code relying on the theano library - check out the simec repo instead for keras based code!
☆10Updated 7 years ago
nguyenkh / NeuralDenoising
Neural-based Noise Filtering from Word Embeddings
☆11Updated 8 years ago
yao62995 / Deep_Reinforcement_Learning
Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc
☆43Updated 8 years ago
deontologician / atari_multitask
Atari gauntlet for RL agents
☆29Updated 8 years ago
bzcheeseman / pytorch-rwa
RWA in pytorch
☆14Updated 8 years ago
carpedm20 / RCMN
Recurrent Convolutional Memory Network (in progress)
☆28Updated 9 years ago
yandexdataschool / sklearn-deeprl
Deep reinforcement learning. In scikit-learn. In less than 50 effective lines.
☆54Updated 8 years ago
jmoudrik / deep-go-wrap
Toolkit designed to ease development of your Deep Neural Network models for the game of Go (weiqi, baduk).
☆21Updated 8 years ago
cavaunpeu / neurally-embedded-emojis
Convolutional variational autoencoders and text-question, emoji-answer models
☆11Updated 8 years ago
hycis / transfer_learning
Mozi, Transfer Learning, Multi-Modal Learning, Theano
☆27Updated 9 years ago
sanghyunyi / alphago_zero
A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"
☆13Updated 7 years ago
louishenrifranc / attention
Attention is All You Need in Sonnet
☆38Updated 7 years ago