blanyal / alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
☆88Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for alpha-zero
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆82Updated last year
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆82Updated 4 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆57Updated 6 months ago
- An environment of the board game Go using OpenAI's Gym API☆167Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- A structured implementation of MuZero☆206Updated 2 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆186Updated last year
- ☆65Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆677Updated last year
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- AlphaGo Zero paper and code for studying purpose☆28Updated 6 years ago
- A simple chess environment for openai/gym☆151Updated 8 months ago
- Pytorch Implementation of MuZero☆343Updated last year
- Sandbox for playing with neural nets for Go☆74Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆187Updated 4 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆24Updated 6 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆163Updated 5 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆65Updated last year
- 21.1 million Go games, 18k-9p☆119Updated 5 years ago
- A 9x9 fork of leela-zero, which is targeted to provide a learning exercise with limited resources☆25Updated last year
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆341Updated 2 years ago
- Solving board games like Connect4 using Deep Reinforcement Learning☆34Updated last year
- This is a Python implementation of Alphazero (for chess) using a custom GUI☆20Updated 6 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning☆286Updated 4 months ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆232Updated 4 months ago