bwfbowen/muax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bwfbowen/muax)

bwfbowen / muax

A project that provides help for using DeepMind's mctx on gym-style environments.

☆66

Alternatives and similar repositories for muax

Users that are interested in muax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lowrollr / mctx-az
View on GitHub
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆27May 2, 2025Updated last year
Carbon225 / mctx-classic
View on GitHub
Classic MCTS example with mctx
☆25May 25, 2023Updated 3 years ago
bwfbowen / SLAMuZero
View on GitHub
☆10Mar 22, 2024Updated 2 years ago
kenjyoung / mctx_learning_demo
View on GitHub
☆55Apr 11, 2023Updated 3 years ago
lowrollr / turbozero
View on GitHub
fast + parallel AlphaZero in JAX
☆112Dec 22, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Jul 17, 2026Updated last week
DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆79Dec 31, 2025Updated 6 months ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
google-deepmind / mctx
View on GitHub
Monte Carlo tree search in JAX
☆2,645Jul 9, 2026Updated 2 weeks ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
sotetsuk / pgx
View on GitHub
♟️ Vectorized RL game environments in JAX
☆634Mar 6, 2025Updated last year
JuliaPlanners / SymbolicMDPs.jl
View on GitHub
MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.
☆16Jun 14, 2024Updated 2 years ago
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆939Dec 20, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
google-deepmind / csuite
View on GitHub
☆47Updated this week
instadeepai / jumanji
View on GitHub
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
☆854Jun 18, 2026Updated last month
werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,845Sep 3, 2024Updated last year
YiwenAI / OpenTensor
View on GitHub
☆19Jan 16, 2025Updated last year
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
ben-eysenbach / mnm
View on GitHub
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆21Oct 6, 2021Updated 4 years ago
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
openai / ppo-ewma
View on GitHub
Code for the paper "Batch size invariance for policy optimization"
☆59Apr 2, 2023Updated 3 years ago
coax-dev / coax
View on GitHub
Modular framework for Reinforcement Learning in python
☆185Feb 1, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mbaltieri / GeneralisedFiltering
View on GitHub
General framework for Bayesian inversion of continuous hierarchical models
☆10Sep 20, 2021Updated 4 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
ingambe / RayEnvWrapper
View on GitHub
OpenAi's gym environment wrapper to vectorize them with Ray
☆23May 25, 2023Updated 3 years ago
araffin / sbx
View on GitHub
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
☆603Updated this week
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
EdanToledo / Stoix
View on GitHub
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
☆416Mar 18, 2026Updated 4 months ago
thibnoel / skel_disk_graph_roadmap
View on GitHub
Python implementation of the Skeleton-Disk-Graph Roadmap planner
☆16Aug 8, 2023Updated 2 years ago
TheodoreWolf / hyperoptax
View on GitHub
Parallel hyperparameter tuning with JAX
☆39Jul 18, 2026Updated last week
instadeepai / flashbax
View on GitHub
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆279Sep 22, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wenhaol / DCM-RSSI
View on GitHub
Project Page: Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination (IROS 2024)
☆22Apr 25, 2025Updated last year
RobertTLange / gymnax
View on GitHub
RL Environments in JAX 🌍
☆910Apr 2, 2026Updated 3 months ago
instadeepai / compass
View on GitHub
COMPASS: Combinatorial Optimization with Policy Adaptation using Latent Space Search
☆47Jun 21, 2024Updated 2 years ago
modanesh / anomalous_rl_envs
View on GitHub
Anomalous versions of OpenAI Gym and PyBullet3 environments
☆15Oct 24, 2021Updated 4 years ago
alexOarga / haiku-geometric
View on GitHub
A collection of graph neural networks implementations in JAX
☆35Nov 28, 2023Updated 2 years ago
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
Shengjiewang-Jason / EfficientZeroV2
View on GitHub
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆120Aug 9, 2024Updated last year