Ergodice / lczero-training
For code etc relating to the network training process.
☆16Updated last week
Related projects ⓘ
Alternatives and complementary repositories for lczero-training
- fast + parallel AlphaZero in JAX☆85Updated this week
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆98Updated last year
- AlphaZero in JAX☆70Updated 7 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated last month
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated last year
- ☆48Updated last year
- ☆81Updated 2 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated last week
- Code for magnetic mirror descent.☆15Updated last year
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Scaling scaling laws with board games.☆43Updated last year
- Standard interface for entity based reinforcement learning environments.☆36Updated 8 months ago
- ♟️ Vectorized RL game environments in JAX☆414Updated last week
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆58Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆14Updated 10 months ago
- Classic MCTS example with mctx☆16Updated last year
- A structured implementation of MuZero☆206Updated 2 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 6 months ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆118Updated 7 months ago
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 6 years ago
- Gated Transformer Model for Computer Vision☆23Updated 3 years ago
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆10Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆191Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- ☆64Updated 3 months ago