zkailinzhang / Py_Alphago
Monte Carlo Tree Search (MCTS) ,realize using python
☆12Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Py_Alphago
- Implementing the supervised learning policy networks of AlphaGo☆13Updated 6 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- Monte Carlo Tree Search implemented in C++☆8Updated 12 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 6 years ago
- ☆47Updated 8 years ago
- A high level API based on Tensorflow☆30Updated 8 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Updated 7 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Updated 3 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- tensorflow deep RL for driving a rover around☆64Updated 7 years ago
- ☆53Updated 7 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 6 months ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 8 years ago
- Asynchronous Advantage Actor Critic☆21Updated 8 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 7 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- ☆12Updated 4 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Updated 8 years ago
- ☆16Updated 7 years ago
- ☆28Updated 5 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 7 years ago