Ergodice / lczero-training

For code etc relating to the network training process.

☆20

Alternatives and similar repositories for lczero-training:

Users that are interested in lczero-training are comparing it to the libraries listed below

rlglab / minizero
MiniZero: An AlphaZero and MuZero Training Framework
☆89Updated 2 months ago
lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆95Updated 4 months ago
kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆73Updated 4 months ago
waterhorse1 / ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆118Updated last year
NTT123 / a0-jax
AlphaZero in JAX
☆77Updated last year
lowrollr / turbozero_torch
fast + parallel AlphaZero in PyTorch
☆11Updated last year
Holmeswww / SPRING
☆14Updated last year
kenjyoung / mctx_learning_demo
☆51Updated 2 years ago
AlignmentResearch / go_attack
☆85Updated 3 months ago
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆187Updated 8 months ago
sotetsuk / pgx
♟️ Vectorized RL game environments in JAX
☆465Updated last month
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆81Updated last month
andyljones / boardlaw
Scaling scaling laws with board games.
☆48Updated last year
adepierre / Caffe_AlphaZero
Implementation of Deepmind's AlphaZero algorithm with Caffe and C++
☆19Updated 7 years ago
sethkarten / pokechamp
Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.
☆52Updated 3 weeks ago
michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆125Updated 6 months ago
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆58Updated 5 months ago
facebookresearch / motif
Intrinsic Motivation from Artificial Intelligence Feedback
☆128Updated last year
epignatelli / navix
Accelerated minigrid environments with JAX
☆134Updated 8 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
levilelis / h-levin
Levin tree search guided by both a policy and a heuristic function
☆18Updated last year
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆72Updated 8 months ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆157Updated 4 years ago
bhansconnect / fast-alphazero-general
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆44Updated 2 years ago
lowrollr / mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆18Updated last year
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆111Updated 8 months ago
instadeepai / outer-value-function-meta-rl
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Updated 2 years ago
cestpasphoto / alpha-zero-general
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
☆46Updated 3 months ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆100Updated last year