michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆128Updated 6 months ago
Alternatives and similar repositories for alpha_zero:
Users that are interested in alpha_zero are comparing it to the libraries listed below
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆76Updated 4 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆91Updated 2 months ago
- fast + parallel AlphaZero in JAX☆96Updated 4 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆110Updated last year
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆212Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆473Updated 2 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆59Updated 5 months ago
- ☆228Updated 5 months ago
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆283Updated 2 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆81Updated 9 months ago
- Pytorch Implementation of MuZero☆352Updated last year
- A PyTorch implementation of DeepMind's MuZero agent☆34Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆157Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆172Updated 3 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆440Updated last week
- AlphaZero in JAX☆77Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆166Updated 2 weeks ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆19Updated last week
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆249Updated 10 months ago
- ☆51Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- Example code for the Gym documentation☆71Updated last year
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆206Updated 2 months ago
- ☆469Updated 2 years ago
- Multi-Agent Reinforcement Learning with JAX☆571Updated last week
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆40Updated 4 years ago