michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆106Updated 3 months ago
Alternatives and similar repositories for alpha_zero:
Users that are interested in alpha_zero are comparing it to the libraries listed below
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆71Updated 2 months ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆206Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆78Updated 2 months ago
- fast + parallel AlphaZero in JAX☆92Updated last month
- ☆214Updated 3 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆45Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- ♟️ Vectorized RL game environments in JAX☆440Updated this week
- An environment of the board game Go using OpenAI's Gym API☆170Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- Pytorch Implementation of MuZero☆348Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆53Updated 3 months ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆196Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- This project is implementation code of AlphaStar☆195Updated last year
- ☆202Updated last year
- A structured implementation of MuZero☆207Updated 2 years ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆107Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆30Updated 2 years ago
- Example code for the Gym documentation☆71Updated last year
- AlphaZero in JAX☆74Updated 10 months ago
- ☆26Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆62Updated 6 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- [NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge☆29Updated 2 years ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆343Updated 3 weeks ago
- ☆49Updated last year
- A Simplified Pytorch Version of the Dreamer Algorithm☆117Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago