JoshVarty / AlphaZeroSimpleLinks

The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with

☆218

Alternatives and similar repositories for AlphaZeroSimple

Users that are interested in AlphaZeroSimple are comparing it to the libraries listed below

Sorting:

kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆78Updated 7 months ago
huangeddie / GymGo
An environment of the board game Go using OpenAI's Gym API
☆175Updated 3 years ago
koulanurag / muzero-pytorch
Pytorch Implementation of MuZero
☆354Updated 2 years ago
foersterrobert / AlphaZeroFromScratch
☆223Updated last year
michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆146Updated 9 months ago
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆160Updated 4 years ago
johan-gras / MuZero
A structured implementation of MuZero
☆205Updated 3 years ago
YeWR / EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆904Updated last year
geochri / AlphaZero_Chess
PyTorch implementation of AlphaZero Chess from scratch
☆166Updated 11 months ago
foersterrobert / AlphaZero
☆32Updated 2 years ago
Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆96Updated 4 years ago
sotetsuk / pgx
♟️ Vectorized RL game environments in JAX
☆510Updated 5 months ago
genyrosk / gym-chess
A simple chess environment for openai/gym
☆161Updated last year
plkmo / AlphaZero_Connect4
PyTorch implementation of AlphaZero Connect from scratch (with results)
☆84Updated 5 years ago
google-deepmind / dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…
☆476Updated last year
bhansconnect / fast-alphazero-general
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆44Updated 2 years ago
Bam4d / Griddly
A grid-world game engine for game AI research
☆246Updated last year
DHDev0 / Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆69Updated last year
vgarciasc / mcts-viz
Visualization of MCTS algorithm applied to Tic-tac-toe.
☆250Updated 3 years ago
Farama-Foundation / MicroRTS-Py
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
☆261Updated last year
yfeng997 / MadMario
Interactive tutorial to build a learning Mario, for first-time RL learners
☆238Updated 2 years ago
AgileRL / AgileRL
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…
☆802Updated last week
philtabor / Deep-Q-Learning-Paper-To-Code
☆412Updated 2 years ago
michaelnny / deep_rl_zoo
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…
☆115Updated last year
int8 / monte-carlo-tree-search
Monte carlo tree search in python
☆610Updated 3 years ago
NTT123 / a0-jax
AlphaZero in JAX
☆78Updated last year
henrycharlesworth / settlers_of_catan_RL
Learning to play Settlers of Catan with Deep RL - custom training environment and implementation of PPO
☆86Updated 3 years ago
JimOhman / model-based-rl
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Updated 2 years ago
instadeepai / jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
☆745Updated last month
Talendar / flappy-bird-gym
An OpenAI Gym environment for the Flappy Bird game
☆126Updated 3 years ago