fbora / tic-tac-GO_ZERO
Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe
☆16Updated 7 years ago
Alternatives and similar repositories for tic-tac-GO_ZERO:
Users that are interested in tic-tac-GO_ZERO are comparing it to the libraries listed below
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 9 months ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Chainer implementation of Double Deep Q-Network (Double DQN)☆27Updated 8 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- ☆18Updated 5 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 6 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- ☆57Updated 2 years ago
- Code to recreate AlphaGo Zero models☆19Updated last year
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- A python client library for microRTS.☆19Updated 5 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 8 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated last year
- Reinforcement learning in 3D.☆21Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆66Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- in progress☆60Updated 9 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆30Updated 7 years ago
- ☆13Updated 9 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆159Updated 5 years ago
- 🤖 Implements of Reinforcement Learning algorithms.☆115Updated 6 years ago
- Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction☆8Updated 7 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆11Updated 8 years ago