petosa / simple-alpha-zeroLinks
Clean, tested, & modular AlphaZero implementation with multiplayer support.
☆18Updated 6 years ago
Alternatives and similar repositories for simple-alpha-zero
Users that are interested in simple-alpha-zero are comparing it to the libraries listed below
Sorting:
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆223Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆81Updated 10 months ago
- An environment of the board game Go using OpenAI's Gym API☆175Updated 3 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆106Updated 3 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆73Updated 2 years ago
- ☆31Updated 2 years ago
- fast + parallel AlphaZero in JAX☆103Updated 10 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆162Updated 4 years ago
- ☆228Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆45Updated 2 years ago
- AlphaZero in JAX☆78Updated last year
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- A structured implementation of MuZero☆205Updated 3 years ago
- Multi-agent reinforcement learning environment☆38Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- An OpenAI Gym environment for the Flappy Bird game☆127Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- ☆52Updated 2 years ago
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆57Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆63Updated 11 months ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 8 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆84Updated 2 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆36Updated last year
- A simple chess environment for openai/gym☆162Updated last year
- Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space…☆72Updated last year
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆158Updated last year
- ☆66Updated 3 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆268Updated 2 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- A checkers reinforcement learning AI, and all the tools needed to train it.☆58Updated 5 years ago