petosa / simple-alpha-zeroLinks
Clean, tested, & modular AlphaZero implementation with multiplayer support.
☆17Updated 6 years ago
Alternatives and similar repositories for simple-alpha-zero
Users that are interested in simple-alpha-zero are comparing it to the libraries listed below
Sorting:
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆76Updated 5 months ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆135Updated 7 months ago
- An environment of the board game Go using OpenAI's Gym API☆172Updated 3 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆214Updated 2 years ago
- A structured implementation of MuZero☆204Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- A project to train a neural network to play Checkers through self-play combined with Monte Carlo Tree Search.☆56Updated 3 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆93Updated 3 months ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- fast + parallel AlphaZero in JAX☆96Updated 5 months ago
- Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space…☆69Updated last year
- A C++ pytorch implementation of MuZero☆38Updated last year
- ☆28Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆65Updated last year
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆125Updated last year
- AlphaZero in JAX☆77Updated last year
- Demo of UCT (MCTS) in Python / Numpy☆86Updated 2 years ago
- ☆51Updated 2 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆90Updated 7 years ago
- An OpenAI Gym interface to Tetris on the NES.☆53Updated last year
- ☆68Updated 3 years ago
- ☆19Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Pytorch Implementation of MuZero☆352Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- ☆48Updated 5 years ago
- Pure Python Library for ES-HyperNEAT. Contains implementations of HyperNEAT and ES-HyperNEAT.☆117Updated last year
- An OpenAI gym environment made for RL☆68Updated last year