Ergodice / lczero-training
For code etc relating to the network training process.
☆20Updated 2 months ago
Alternatives and similar repositories for lczero-training:
Users that are interested in lczero-training are comparing it to the libraries listed below
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 3 months ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆110Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆85Updated last month
- AlphaZero in JAX☆77Updated 11 months ago
- ☆84Updated 2 months ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆17Updated last year
- For code etc relating to the network training process.☆156Updated 9 months ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- A tool for lc0 training data operations☆28Updated 10 months ago
- Scaling scaling laws with board games.☆48Updated last year
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆119Updated 5 months ago
- A C++ pytorch implementation of MuZero☆37Updated 11 months ago
- Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.☆47Updated this week
- A project to train a neural network to play Checkers through self-play combined with Monte Carlo Tree Search.☆54Updated 3 years ago
- ☆50Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 4 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- ☆67Updated 3 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- A neural net chess engine in 95 lines of python☆74Updated 4 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated last year
- Classic MCTS example with mctx☆18Updated last year
- An implementation of PPO in Pytorch☆69Updated last month
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago