blanyal / alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
☆88Updated 6 years ago
Alternatives and similar repositories for alpha-zero:
Users that are interested in alpha-zero are comparing it to the libraries listed below
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 3 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- A structured implementation of MuZero☆207Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- ☆66Updated 3 years ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆80Updated 5 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆203Updated last year
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 8 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆193Updated 5 years ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆677Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆340Updated 2 years ago
- Pytorch Implementation of MuZero☆347Updated last year
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- AlphaGo Zero paper and code for studying purpose☆28Updated 6 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆69Updated last month
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- Chess reinforcement learning by AlphaZero methods.☆17Updated 7 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆165Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆155Updated 3 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 6 months ago
- Sandbox for playing with neural nets for Go☆76Updated 5 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- Half Field Offense in Robocup 2D Soccer☆230Updated 2 years ago
- A checkers reinforcement learning AI, and all the tools needed to train it.☆55Updated 4 years ago
- Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)☆535Updated 4 years ago
- A simple chess environment for openai/gym☆154Updated 11 months ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago