kevaday / alphazero-generalLinks
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆84Updated last year
Alternatives and similar repositories for alphazero-general
Users that are interested in alphazero-general are comparing it to the libraries listed below
Sorting:
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆230Updated 2 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆116Updated 5 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆163Updated last year
- fast + parallel AlphaZero in JAX☆108Updated last year
- A structured implementation of MuZero☆206Updated 3 years ago
- AlphaZero in JAX☆81Updated last year
- Pytorch Implementation of MuZero☆354Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆177Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆50Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆563Updated 9 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆918Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- ☆91Updated 11 months ago
- ☆525Updated 3 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆278Updated 4 months ago
- A project to train a neural network to play Checkers through self-play combined with Monte Carlo Tree Search.☆59Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆63Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play