fast + parallel AlphaZero in JAX
☆109Dec 22, 2024Updated last year
Alternatives and similar repositories for turbozero
Users that are interested in turbozero are comparing it to the libraries listed below
Sorting:
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 10 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆64Nov 14, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆588Mar 6, 2025Updated 11 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆261Oct 31, 2025Updated 4 months ago
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆810Dec 1, 2025Updated 3 months ago
- Monte Carlo tree search in JAX☆2,596Sep 2, 2025Updated 6 months ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆122Feb 25, 2026Updated last week
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- Corax: Core RL in JAX☆39Feb 22, 2024Updated 2 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 2 months ago
- ☆54Apr 11, 2023Updated 2 years ago
- ☆25Apr 16, 2024Updated last year
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆15Jun 16, 2024Updated last year
- AlphaZero in JAX☆81Apr 3, 2024Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆37Aug 8, 2024Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,538Feb 25, 2026Updated last week
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆273Sep 22, 2025Updated 5 months ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Named Tensors for Legible Deep Learning in JAX☆217Nov 8, 2025Updated 3 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆237Nov 24, 2025Updated 3 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆925Dec 20, 2023Updated 2 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆22Jan 22, 2024Updated 2 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- RL Environments in JAX 🌍☆864May 30, 2025Updated 9 months ago
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- A collection of Matplotlib plot templates.☆25Oct 15, 2023Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆374Feb 10, 2026Updated 3 weeks ago
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆39Jan 16, 2026Updated last month
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- ☆23Aug 19, 2022Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- A Simplified Pytorch Version of the Dreamer Algorithm☆149Jul 24, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago