zkailinzhang / Py_AlphagoLinks
Monte Carlo Tree Search (MCTS) ,realize using python
☆12Updated 9 years ago
Alternatives and similar repositories for Py_Alphago
Users that are interested in Py_Alphago are comparing it to the libraries listed below
Sorting:
- Implementing the supervised learning policy networks of AlphaGo☆12Updated 7 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 9 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- tensorflow deep RL for driving a rover around☆64Updated 8 years ago
- 9x9 AlphaGo☆13Updated 9 years ago
- using CNN to do move prediction and board evaluation for the board game Go☆147Updated 7 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- ☆39Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆40Updated 2 years ago
- Implementation of q-learning using TensorFlow☆58Updated 8 years ago
- Reinforcement learning with a convolutional neural network.☆35Updated 10 years ago
- Using tensorflow, this agent can autonomously train itself to play Out Run and potentially be modified to play other games or perform tas…☆69Updated 8 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 8 years ago
- DDPG on OpenAI Gym Pendulum☆18Updated 9 years ago
- ☆28Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆70Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 7 years ago
- ☆12Updated 4 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Updated 8 years ago
- ☆42Updated 4 years ago
- Deep reinforcement learning. In scikit-learn. In less than 50 effective lines.☆54Updated 8 years ago
- A high level API based on Tensorflow☆30Updated 8 years ago
- Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library☆18Updated 7 years ago
- ☆27Updated 7 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 8 years ago
- ☆53Updated 8 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆11Updated 9 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago