blanyal / alpha-zeroLinks
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
☆90Updated 7 years ago
Alternatives and similar repositories for alpha-zero
Users that are interested in alpha-zero are comparing it to the libraries listed below
Sorting:
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆82Updated 2 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆172Updated 3 years ago
- A structured implementation of MuZero☆204Updated 3 years ago
- A checkers reinforcement learning AI, and all the tools needed to train it.☆56Updated 5 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated last year
- Demo of UCT (MCTS) in Python / Numpy☆86Updated 2 years ago
- Pytorch Implementation of MuZero☆352Updated last year
- Reversi reinforcement learning by AlphaGo Zero methods.☆678Updated 2 years ago
- Board game AI implementations using Monte Carlo Tree Search☆182Updated 5 years ago
- ☆67Updated 3 years ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆81Updated 5 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆76Updated 5 months ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆215Updated 2 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆342Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- AlphaGo Zero paper and code for studying purpose☆28Updated 7 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆210Updated 3 months ago
- A 9x9 fork of leela-zero, which is targeted to provide a learning exercise with limited resources☆25Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆80Updated 6 years ago
- Sandbox for playing with neural nets for Go☆75Updated 6 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- datasets for computer go☆153Updated 11 months ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆368Updated last year
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 7 years ago
- Open AI gym environment for the game 2048☆73Updated 3 years ago
- Monte carlo tree search in python☆603Updated 2 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆72Updated 4 years ago
- Monte Carlo Tree Search for tic tac toe☆36Updated 6 years ago