zkailinzhang / Py_Alphago
Monte Carlo Tree Search (MCTS) ,realize using python
☆11Updated 9 years ago
Alternatives and similar repositories for Py_Alphago:
Users that are interested in Py_Alphago are comparing it to the libraries listed below
- Implementing the supervised learning policy networks of AlphaGo☆12Updated 7 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆42Updated 8 years ago
- DDPG on OpenAI Gym Pendulum☆18Updated 8 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 8 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated last year
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆26Updated 2 years ago
- ☆10Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- 翻译系统,encode-decode模型☆9Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- ☆12Updated 4 years ago
- ☆27Updated 7 years ago
- the python code of the book:Machine Learning for Spark☆8Updated 8 years ago
- Code for the experiment using reinforcement learning and LSTM networks to learn binary search.☆6Updated 8 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- ☆28Updated 5 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- Tensorflow implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"☆48Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆10Updated 8 years ago
- A tool to get the arxiv papers☆19Updated 7 years ago
- ☆47Updated 9 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago