scascin0 / alphazeroLinks

A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.

☆13

Alternatives and similar repositories for alphazero

Users that are interested in alphazero are comparing it to the libraries listed below

Sorting:

tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆53Updated last month
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆56Updated 2 years ago
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆67Updated 2 years ago
linesd / tabular-methods
Tabular methods for reinforcement learning
☆38Updated 4 years ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago
henry-prior / jax-rl
JAX implementations of core Deep RL algorithms
☆79Updated 3 years ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆113Updated 10 months ago
epignatelli / navix
Accelerated minigrid environments with JAX
☆139Updated 2 weeks ago
danijar / ninjax
General Modules for JAX
☆65Updated 2 months ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
DramaCow / jaxued
☆82Updated 3 months ago
chandar-lab / RLHive
☆101Updated last year
rystrauss / dopamax
Reinforcement learning in pure JAX.
☆13Updated 4 months ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated last year
roger-creus / Wave-Defense-Learning-Environment
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Updated 2 years ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆98Updated 2 years ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
ethanluoyc / magi
Reinforcement learning library in JAX.
☆100Updated last year
kenjyoung / mctx_learning_demo
☆51Updated 2 years ago
lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
MyNameIsArko / RL-Flax
Various reinforcement learning algorithms written in Jax + Flax
☆26Updated 2 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
facebookresearch / mtenv
MultiTask Environments for Reinforcement Learning.
☆76Updated 2 years ago
chamorajg / pl-dreamer
Simplistic Pytorch Implementation of the Dreamer-RL
☆21Updated last month
Max-We / alphazero-tetris
An implementation of AlphaZero and MCTS with neural networks for Tetris
☆21Updated 3 months ago
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 3 months ago
zach-lawless / gym-wordle
Gym environment for playing Wordle with RL agents
☆39Updated 3 years ago
tristandeleu / jax-meta-learning
A collection of meta-learning algorithms in Jax
☆23Updated 2 years ago