blanyal / alpha-zeroLinks

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

☆90

Alternatives and similar repositories for alpha-zero

Users that are interested in alpha-zero are comparing it to the libraries listed below

Sorting:

Zeta36 / connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
☆113Updated 4 years ago
huangeddie / GymGo
An environment of the board game Go using OpenAI's Gym API
☆175Updated 3 years ago
richemslie / galvanise_zero
Learning from zero (mostly based off of AlphaZero) in General Game Playing.
☆83Updated 2 years ago
johan-gras / MuZero
A structured implementation of MuZero
☆205Updated 3 years ago
plkmo / AlphaZero_Connect4
PyTorch implementation of AlphaZero Connect from scratch (with results)
☆84Updated 5 years ago
mokemokechicken / reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
☆681Updated 2 years ago
dylandjian / SuperGo
A student implementation of Alpha Go Zero
☆280Updated 7 years ago
int8 / monte-carlo-tree-search
Monte carlo tree search in python
☆610Updated 3 years ago
koulanurag / muzero-pytorch
Pytorch Implementation of MuZero
☆354Updated 2 years ago
YuriCat / MuZeroJupyterExample
☆67Updated 3 years ago
Farama-Foundation / MicroRTS
A simple and highly efficient RTS-game-inspired environment for reinforcement learning
☆316Updated last year
JoshVarty / AlphaZeroSimple
The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with
☆218Updated 2 years ago
Narsil / alphagozero
Unofficial attempt to rebuild AlphaGo Zero
☆58Updated last year
jbradberry / mcts
Board game AI implementations using Monte Carlo Tree Search
☆184Updated 5 years ago
yhyu13 / AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …
☆343Updated 2 years ago
initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆209Updated 5 months ago
Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆96Updated 4 years ago
yenw / computer-go-dataset
datasets for computer go
☆153Updated last year
inoryy / reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
☆559Updated 4 years ago
rockingdingo / gym-gomoku
OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)
☆88Updated 9 months ago
kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆78Updated 7 months ago
bhansconnect / alpha_zero_othello
A functional Alpha Zero that plays Othello using Keras
☆116Updated 2 years ago
Nasdin / ReinforcementLearning-AtariGame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…
☆187Updated 10 months ago
Kojoley / atari-py
An `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.
☆183Updated 3 years ago
int8 / counterfactual-regret-minimization
Counterfactual regret minimization algorithm for Kuhn poker
☆175Updated 6 years ago
openai / random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
☆909Updated 4 years ago
SamRagusa / Checkers-Reinforcement-Learning
A checkers reinforcement learning AI, and all the tools needed to train it.
☆57Updated 5 years ago
hrpan / tetris_mcts
MCTS project for Tetris
☆348Updated 9 months ago
openai / gym-soccer
☆304Updated 2 years ago
rgal / gym-2048
Open AI gym environment for the game 2048
☆73Updated 3 years ago