Ergodice / lczero-training
For code etc relating to the network training process.
☆20Updated 3 months ago
Alternatives and similar repositories for lczero-training:
Users that are interested in lczero-training are comparing it to the libraries listed below
- MiniZero: An AlphaZero and MuZero Training Framework☆89Updated 2 months ago
- fast + parallel AlphaZero in JAX☆95Updated 4 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆118Updated last year
- AlphaZero in JAX☆77Updated last year
- fast + parallel AlphaZero in PyTorch☆11Updated last year
- ☆14Updated last year
- ☆51Updated 2 years ago
- ☆85Updated 3 months ago
- Efficient baselines for autocurricula in JAX.☆187Updated 8 months ago
- ♟️ Vectorized RL game environments in JAX☆465Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆81Updated last month
- Scaling scaling laws with board games.☆48Updated last year
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 7 years ago
- Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.☆52Updated 3 weeks ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆125Updated 6 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 5 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆128Updated last year
- Accelerated minigrid environments with JAX☆134Updated 8 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Levin tree search guided by both a policy and a heuristic function☆18Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆157Updated 4 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆18Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available☆46Updated 3 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year