michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆81Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for alpha_zero
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆191Updated last year
- Pytorch Implementation of MuZero☆343Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- A structured implementation of MuZero☆206Updated 2 years ago
- This project is implementation code of AlphaStar☆187Updated 10 months ago
- ♟️ Vectorized RL game environments in JAX☆414Updated last week
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- ☆99Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆248Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated last month
- ☆235Updated 2 years ago
- ☆201Updated this week
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆106Updated 9 months ago
- [NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge☆28Updated last year
- ☆418Updated 2 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆144Updated 2 weeks ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated last week
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- ☆48Updated last year
- SBX: Stable Baselines Jax (SB3 + Jax)☆345Updated this week
- fast + parallel AlphaZero in JAX☆85Updated 7 months ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆234Updated 4 months ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆44Updated 3 months ago
- Partially Observable Process Gym☆167Updated 4 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆208Updated last month
- Multi-Agent Reinforcement Learning with JAX☆441Updated 2 weeks ago