daniel-monroe / lczero-trainingLinks
For code etc relating to the network training process.
☆26Updated last year
Alternatives and similar repositories for lczero-training
Users that are interested in lczero-training are comparing it to the libraries listed below
Sorting:
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆116Updated 5 months ago
- fast + parallel AlphaZero in JAX☆108Updated last year
- ☆18Updated 11 months ago
- AlphaZero in JAX☆81Updated last year
- ☆92Updated 11 months ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆133Updated 2 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆130Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆85Updated last year
- An environment for learning formal mathematical reasoning from scratch☆72Updated last year
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆36Updated 10 months ago
- fast + parallel AlphaZero in PyTorch☆15Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated last month
- ☆53Updated 2 years ago
- ☆213Updated 4 months ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆25Updated 8 months ago
- Towards Formalizing RL Theory☆40Updated 2 months ago
- ☆108Updated 5 months ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆69Updated last year
- Bootstrapping ARC☆153Updated last year
- Learning Universal Predictors☆81Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆197Updated 2 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Genetic programming using LLMs☆54Updated 10 months ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- ☆25Updated 3 years ago
- A C++ pytorch implementation of MuZero☆41Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆33Updated 4 years ago