An implementation of MuZero in JAX.
☆57Nov 8, 2022Updated 3 years ago
Alternatives and similar repositories for jax_muzero
Users that are interested in jax_muzero are comparing it to the libraries listed below
Sorting:
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- AlphaZero in JAX☆81Apr 3, 2024Updated last year
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆925Dec 20, 2023Updated 2 years ago
- RL Environments in JAX 🌍☆868May 30, 2025Updated 9 months ago
- Reinforcement learning library in JAX.☆101Oct 22, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆18Jun 30, 2021Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82May 13, 2024Updated last year
- minGPT in JAX☆48Jan 10, 2022Updated 4 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆122Feb 25, 2026Updated last week
- ☆19Mar 1, 2023Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Aug 22, 2024Updated last year
- ☆42May 11, 2022Updated 3 years ago
- an environment based on XLA for deep learning compiler optimization research.☆24Mar 7, 2023Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆65Nov 14, 2024Updated last year
- General Modules for JAX☆72Feb 21, 2026Updated 2 weeks ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆753Oct 26, 2022Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆591Mar 6, 2025Updated last year
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- Reinforcement learning with Rust☆14Jul 31, 2022Updated 3 years ago
- ☆46Sep 24, 2024Updated last year
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,274Aug 12, 2024Updated last year
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- Monte Carlo tree search in JAX☆2,596Sep 2, 2025Updated 6 months ago
- ☆52Jan 20, 2023Updated 3 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,542Updated this week
- ☆31Aug 25, 2022Updated 3 years ago