Hwhitetooth/jax_muzero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hwhitetooth/jax_muzero)

Hwhitetooth / jax_muzero

An implementation of MuZero in JAX.

☆58

Alternatives and similar repositories for jax_muzero

Users that are interested in jax_muzero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
frt03 / inference-based-rl
View on GitHub
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)
☆20Oct 25, 2021Updated 4 years ago
facebookresearch / entity-factored-rl
View on GitHub
Source code for the paper "Policy Architectures for Compositional Generalization in Control"
☆30May 19, 2022Updated 4 years ago
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RobertTLange / gymnax
View on GitHub
RL Environments in JAX 🌍
☆910Apr 2, 2026Updated 3 months ago
NTT123 / a0-jax
View on GitHub
AlphaZero in JAX
☆82Apr 3, 2024Updated 2 years ago
tuero / muzero-cpp
View on GitHub
A C++ pytorch implementation of MuZero
☆40May 18, 2026Updated 2 months ago
bwfbowen / muax
View on GitHub
A project that provides help for using DeepMind's mctx on gym-style environments.
☆66Nov 14, 2024Updated last year
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆938Dec 20, 2023Updated 2 years ago
ethanluoyc / magi
View on GitHub
Reinforcement learning library in JAX.
☆102Oct 22, 2023Updated 2 years ago
mgrankin / minGPT
View on GitHub
minGPT in JAX
☆49Jan 10, 2022Updated 4 years ago
bstadie / krazyworld
View on GitHub
krazy grid world
☆26Mar 2, 2020Updated 6 years ago
vwxyzjn / cleanba
View on GitHub
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆125Aug 22, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sail-sg / hloenv
View on GitHub
an environment based on XLA for deep learning compiler optimization research.
☆24Mar 7, 2023Updated 3 years ago
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
google-deepmind / csuite
View on GitHub
☆47Jul 22, 2026Updated last week
sotetsuk / pgx
View on GitHub
♟️ Vectorized RL game environments in JAX
☆634Mar 6, 2025Updated last year
timeolord / Reinforcement-Learning-Stock-Trader
View on GitHub
Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…
☆20Jun 24, 2026Updated last month
danijar / ninjax
View on GitHub
General Modules for JAX
☆74Apr 7, 2026Updated 3 months ago
kaesve / muzero
View on GitHub
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆169Mar 28, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
orybkin / lexa-benchmark
View on GitHub
☆42May 11, 2022Updated 4 years ago
ikostrikov / jaxrl
View on GitHub
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
☆757Oct 26, 2022Updated 3 years ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
sail-sg / optim4rl
View on GitHub
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
leor-c / REM
View on GitHub
Improving Token-Based World Models with Parallel Observation Prediction (ICML 2024)
☆14Feb 23, 2026Updated 5 months ago
ituvisionlab / EdVAE
View on GitHub
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
☆14Sep 20, 2024Updated last year
orybkin / lexa
View on GitHub
Discovering and Achieving Goals via World Models, NeurIPS 2021
☆90Jan 24, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
google-deepmind / mctx
View on GitHub
Monte Carlo tree search in JAX
☆2,649Jul 9, 2026Updated 2 weeks ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
mazpie / redundancy-action-spaces
View on GitHub
[RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation
☆23Jun 6, 2024Updated 2 years ago
jesbu1 / hidio
View on GitHub
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆48Dec 10, 2021Updated 4 years ago
sail-sg / envpool
View on GitHub
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
☆1,489Jul 17, 2026Updated last week
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆137Jul 17, 2026Updated last week