CogitoNTNU / AlphaZero
An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row
☆20Updated 2 years ago
Alternatives and similar repositories for AlphaZero:
Users that are interested in AlphaZero are comparing it to the libraries listed below
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- AlphaZero in JAX☆75Updated 11 months ago
- fast + parallel AlphaZero in JAX☆92Updated 2 months ago
- ☆66Updated 3 years ago
- A structured implementation of MuZero☆207Updated 2 years ago
- Modular framework for Reinforcement Learning in python☆171Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆55Updated 3 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Pytorch Implementation of MuZero☆348Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆138Updated 3 months ago
- ☆50Updated last year
- ♟️ Vectorized RL game environments in JAX☆444Updated 2 weeks ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆71Updated 2 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆206Updated last year
- A grid-world game engine for game AI research☆239Updated 10 months ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 2 years ago
- This project is implementation code of AlphaStar☆196Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆287Updated last week
- An API conversion tool for popular external reinforcement learning environments☆152Updated last month
- ☆215Updated 3 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆285Updated this week
- Partially Observable Process Gym☆178Updated 7 months ago
- A PyTorch implementation of DeepMind's MuZero agent☆29Updated last year
- The NetHack Learning Environment☆62Updated this week
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆195Updated 2 years ago
- Develop your agent for generals.io!☆40Updated this week
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆265Updated last week
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago