daniel-monroe / lczero-trainingLinks

For code etc relating to the network training process.

☆21

Alternatives and similar repositories for lczero-training

Users that are interested in lczero-training are comparing it to the libraries listed below

Sorting:

rlglab / minizero
MiniZero: An AlphaZero and MuZero Training Framework
☆94Updated 4 months ago
AlignmentResearch / go_attack
☆86Updated 6 months ago
kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆78Updated 7 months ago
lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
NTT123 / a0-jax
AlphaZero in JAX
☆78Updated last year
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆159Updated 4 years ago
waterhorse1 / ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆124Updated last year
lowrollr / turbozero_torch
fast + parallel AlphaZero in PyTorch
☆12Updated last year
tuero / muzero-cpp
A C++ pytorch implementation of MuZero
☆39Updated last year
sethkarten / pokechamp
Official repository of the ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.
☆67Updated last week
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated last year
johan-gras / MuZero
A structured implementation of MuZero
☆204Updated 3 years ago
kenjyoung / mctx_learning_demo
☆52Updated 2 years ago
MetcalfeTom / stable-baselines3-GPU
A GPU-accelerated fork of stable-baselines. Delivering reliable implementations of reinforcement learning algorithms.
☆23Updated 4 years ago
timoklein / alphazero-gym
AlphaZero for continuous control tasks
☆23Updated 2 years ago
baskuit / R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
☆47Updated 2 years ago
lowrollr / mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆20Updated 2 months ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
LeelaChessZero / lczero-training
For code etc relating to the network training process.
☆162Updated last year
facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆46Updated 2 years ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
enpasos / muzero
☆13Updated 6 months ago
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 8 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
levilelis / h-levin
Levin tree search guided by both a policy and a heuristic function
☆19Updated 2 years ago
sotetsuk / pgx
♟️ Vectorized RL game environments in JAX
☆494Updated 4 months ago
gpoesia / peano
An environment for learning formal mathematical reasoning from scratch
☆71Updated 10 months ago
Carbon225 / mctx-classic
Classic MCTS example with mctx
☆18Updated 2 years ago
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆189Updated 10 months ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 10 months ago