zkailinzhang / Py_AlphagoLinks
Monte Carlo Tree Search (MCTS) ,realize using python
☆12Updated 9 years ago
Alternatives and similar repositories for Py_Alphago
Users that are interested in Py_Alphago are comparing it to the libraries listed below
Sorting:
- Implementing the supervised learning policy networks of AlphaGo☆12Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 9 years ago
- 9x9 AlphaGo☆13Updated 9 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- tensorflow deep RL for driving a rover around☆64Updated 8 years ago
- Implementation of q-learning using TensorFlow☆58Updated 8 years ago
- using CNN to do move prediction and board evaluation for the board game Go☆147Updated 7 years ago
- Reinforcement learning with a convolutional neural network.☆35Updated 10 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Updated 9 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- ☆39Updated 8 years ago
- ☆12Updated 5 years ago
- DDPG on OpenAI Gym Pendulum☆18Updated 9 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Updated 9 years ago
- Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library☆18Updated 7 years ago
- deep reinforcement learning for personal research☆85Updated 8 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 8 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆70Updated 8 years ago
- Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.☆71Updated 4 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 9 years ago
- Combining deep learning and reinforcement learning.☆81Updated 4 years ago
- Toolkit designed to ease development of your Deep Neural Network models for the game of Go (weiqi, baduk).☆20Updated 8 years ago
- A powerful machine learning algorithm utilizing Q-Learning and Neural Networks, implemented using Torch and Lua.☆26Updated 10 years ago
- ☆27Updated 7 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 7 years ago
- Implements basic reinforcement learning algorithms to control a remote-control car in a room.☆45Updated 9 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 8 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago