fbora / tic-tac-GO_ZERO
Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe
☆16Updated 7 years ago
Alternatives and similar repositories for tic-tac-GO_ZERO
Users that are interested in tic-tac-GO_ZERO are comparing it to the libraries listed below
Sorting:
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated last year
- Code to recreate AlphaGo Zero models☆19Updated 2 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆11Updated 9 years ago
- ☆18Updated 6 years ago
- reinforcement learning algorithms from the book by Sutton and Barto☆16Updated 4 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆42Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆85Updated 2 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- ☆12Updated 4 years ago
- Distributed Tensorflow Implementation of Asynchronous DDPG☆12Updated 7 years ago
- Chainer implementation of Double Deep Q-Network (Double DQN)☆27Updated 9 years ago
- Reinforcement learning algorithms to play Poker☆14Updated 3 years ago
- Investigations into simplified holdem poker☆12Updated 12 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84Updated 7 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 5 years ago
- ☆28Updated 6 years ago
- in progress☆60Updated 9 years ago
- ☆22Updated 6 years ago
- ☆56Updated 2 years ago