CogitoNTNU / AlphaZeroLinks
An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row
☆21Updated 2 years ago
Alternatives and similar repositories for AlphaZero
Users that are interested in AlphaZero are comparing it to the libraries listed below
Sorting:
- A structured implementation of MuZero☆204Updated 3 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆216Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆159Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 4 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆78Updated 7 months ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆83Updated 5 years ago
- An environment of the board game Go using OpenAI's Gym API☆175Updated 3 years ago
- AlphaZero in JAX☆78Updated last year
- Pytorch Implementation of MuZero☆353Updated last year
- ☆222Updated last year
- ♟️ Vectorized RL game environments in JAX☆498Updated 4 months ago
- ☆67Updated 3 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- A simple chess environment for openai/gym☆161Updated last year
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- ☆30Updated 2 years ago
- Reference implementation of DeepMinds AlphaGo based on "Deep Learning and the Game of Go"☆46Updated 6 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆94Updated 4 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- A grid-world game engine for game AI research☆245Updated last year
- Clean, tested, & modular AlphaZero implementation with multiplayer support.☆17Updated 6 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆60Updated 8 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆903Updated last year
- ☆18Updated 3 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆477Updated 3 weeks ago
- Really Fast End-to-End Jax RL Implementations☆908Updated 10 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆67Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆326Updated last week
- RL Environments in JAX 🌍☆782Updated last month
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆680Updated last year