michaelnny/alpha_zero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/michaelnny/alpha_zero)

michaelnny / alpha_zero

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

☆191

Alternatives and similar repositories for alpha_zero

Users that are interested in alpha_zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

suragnair / alpha-zero-general
View on GitHub
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
☆4,484Jan 1, 2025Updated last year
kevaday / alphazero-general
View on GitHub
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆90Dec 11, 2024Updated last year
YiwenAI / OpenTensor
View on GitHub
☆19Jan 16, 2025Updated last year
michaelnny / muzero
View on GitHub
A PyTorch implementation of DeepMind's MuZero agent
☆37Dec 1, 2023Updated 2 years ago
DenseLance / mcts-simple
View on GitHub
mcts-simple is a Python3 library that implements Monte Carlo Tree Search and its variants to solve a host of problems, most commonly for …
☆33Aug 8, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flixpar / AlphaTSP
View on GitHub
AlphaGo inspired TSP Heuristic Solver
☆14Feb 5, 2020Updated 6 years ago
tmoer / alphazero_singleplayer
View on GitHub
Single player Alpha Zero implementation
☆42Mar 7, 2022Updated 4 years ago
deep-reinforcement-learning-book / Chapter15-AlphaZero
View on GitHub
Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
initial-h / AlphaZero_Gomoku_MPI
View on GitHub
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆221Feb 28, 2025Updated last year
bhansconnect / fast-alphazero-general
View on GitHub
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆47Dec 27, 2022Updated 3 years ago
opendilab / LightZero
View on GitHub
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…
☆1,626Jul 17, 2026Updated last week
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
bwfbowen / muax
View on GitHub
A project that provides help for using DeepMind's mctx on gym-style environments.
☆66Nov 14, 2024Updated last year
roahmlab / RADIUS
View on GitHub
A Real-Time Reachability-based Motion Planning Algorithm for Risk-Aware Motion Planning in Uncertain Environments.
☆35Dec 13, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Optimal-Control-16-745 / JuliaIntro
View on GitHub
Some of the basics for getting started with Julia
☆22Feb 3, 2021Updated 5 years ago
s-diaco / DRL4Trading
View on GitHub
Trade using DRL algorithms on tensorflow2 and tf-agents
☆11Oct 10, 2025Updated 9 months ago
AaronYALai / Reinforcement_Learning_Project
View on GitHub
(Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.
☆19Oct 8, 2016Updated 9 years ago
AranKomat / Alpha-Transformer
View on GitHub
Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search
☆28Nov 15, 2018Updated 7 years ago
gpoesia / peano
View on GitHub
An environment for learning formal mathematical reasoning from scratch
☆72Aug 18, 2024Updated last year
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
lowrollr / turbozero
View on GitHub
fast + parallel AlphaZero in JAX
☆112Dec 22, 2024Updated last year
foersterrobert / AlphaZero
View on GitHub
☆36Feb 25, 2026Updated 5 months ago
petrikvladimir / RoboMeshCat
View on GitHub
Set of utilities for visualizing robots in web-based visualizer MeshCat.
☆35Jan 29, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aedera / anc2vec
View on GitHub
Unsupervised neural network for learning embeddings of GO terms.
☆21Feb 19, 2022Updated 4 years ago
google-deepmind / mctx
View on GitHub
Monte Carlo tree search in JAX
☆2,649Jul 9, 2026Updated 3 weeks ago
miaoruonan / MACA_test
View on GitHub
☆11Mar 18, 2021Updated 5 years ago
sanderland / pysgf
View on GitHub
Small and simple SGF parser for python
☆11Mar 9, 2026Updated 4 months ago
StanfordVL / alignment
View on GitHub
ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward
☆20Dec 5, 2022Updated 3 years ago
stephane-caron / palimpsest
View on GitHub
Fast serializable C++ dictionaries
☆15Dec 30, 2025Updated 6 months ago
p-holl / PDE-Control
View on GitHub
Code for the ICLR 2020 paper "Learning to Control PDEs"
☆37Apr 22, 2020Updated 6 years ago
PacktPublishing / Quantum-Computing-Experimentation-with-Amazon-Braket
View on GitHub
Quantum Computing Experimentation with Amazon Braket, published by Packt
☆21Feb 22, 2025Updated last year
AccomplishedCode / Deep-Reinforcement-Learning-Stock-Trader
View on GitHub
☆13Apr 28, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ejmejm / discrete-representations-for-continual-rl
View on GitHub
Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"
☆16Jun 16, 2024Updated 2 years ago
GAIGResearch / Stratega
View on GitHub
Documentation: https://stratega.readthedocs.io/en/latest/
☆55Jun 10, 2025Updated last year
kenjyoung / mctx_learning_demo
View on GitHub
☆55Apr 11, 2023Updated 3 years ago
stephane-caron / qpmpc
View on GitHub
Model predictive control in Python based on quadratic programming
☆52Jul 21, 2026Updated last week
JuliaPlanners / SymbolicMDPs.jl
View on GitHub
MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.
☆16Jun 14, 2024Updated 2 years ago
mcleish7 / arithmetic
View on GitHub
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆200May 28, 2024Updated 2 years ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago