michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆95Updated 2 months ago
Alternatives and similar repositories for alpha_zero:
Users that are interested in alpha_zero are comparing it to the libraries listed below
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆68Updated last month
- MiniZero: An AlphaZero and MuZero Training Framework☆76Updated last month
- fast + parallel AlphaZero in JAX☆90Updated 3 weeks ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆200Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆42Updated last year
- ☆192Updated last year
- ♟️ Vectorized RL game environments in JAX☆433Updated last month
- A project that provides help for using DeepMind's mctx on gym-style environments.☆52Updated 2 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆59Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆56Updated 5 months ago
- Pytorch Implementation of MuZero☆347Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆30Updated 2 years ago
- AlphaZero in JAX☆72Updated 9 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- ☆48Updated last year
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆376Updated this week
- ☆209Updated last month
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆107Updated 10 months ago
- A PyTorch implementation of DeepMind's MuZero agent☆29Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆114Updated 3 years ago
- An API conversion tool for popular external reinforcement learning environments☆146Updated last week
- ☆242Updated 2 years ago
- ☆16Updated 3 years ago
- Transformer-based World Models☆75Updated last year
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 8 months ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆37Updated 4 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated last year
- A Simplified Pytorch Version of the Dreamer Algorithm☆114Updated last year
- ☆13Updated 2 years ago
- ☆26Updated last year