zkailinzhang / Py_Alphago
Monte Carlo Tree Search (MCTS) ,realize using python
☆12Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Py_Alphago
- Implementing the supervised learning policy networks of AlphaGo☆13Updated 6 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- Monte Carlo Tree Search implemented in C++☆8Updated 12 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 6 years ago
- Toolkit designed to ease development of your Deep Neural Network models for the game of Go (weiqi, baduk).☆20Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- ☆39Updated 7 years ago
- ☆53Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- ☆12Updated 4 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆57Updated 6 months ago
- ☆26Updated 6 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Updated 7 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆62Updated 7 years ago
- ☆10Updated 8 years ago
- tensorflow deep RL for driving a rover around☆64Updated 7 years ago
- Reinforcement learning with docker and torcs☆20Updated 7 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Updated 8 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated last year
- A tool to get the arxiv papers☆19Updated 7 years ago
- ☆47Updated 8 years ago