A project that provides help for using DeepMind's mctx on gym-style environments.
☆66Nov 14, 2024Updated last year
Alternatives and similar repositories for muax
Users that are interested in muax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆78Dec 31, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Monte Carlo tree search in JAX☆2,631Sep 2, 2025Updated 9 months ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆131May 9, 2026Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆617Mar 6, 2025Updated last year
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆934Dec 20, 2023Updated 2 years ago
- ☆47Sep 24, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆840Jun 3, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Jan 16, 2025Updated last year
- MuZero☆2,828Sep 3, 2024Updated last year
- Synchronized Curriculum Learning for RL Agents☆124Apr 18, 2026Updated last month
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 3 years ago
- Modular framework for Reinforcement Learning in python☆185Feb 1, 2023Updated 3 years ago
- General framework for Bayesian inversion of continuous hierarchical models☆10Sep 20, 2021Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- OpenAi's gym environment wrapper to vectorize them with Ray☆23May 25, 2023Updated 3 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆598May 11, 2026Updated last month
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆410Mar 18, 2026Updated 2 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆278Sep 22, 2025Updated 8 months ago
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆27May 18, 2025Updated last year
- Anomalous versions of OpenAI Gym and PyBullet3 environments☆15Oct 24, 2021Updated 4 years ago
- RL Environments in JAX 🌍☆901Apr 2, 2026Updated 2 months ago
- A collection of graph neural networks implementations in JAX☆35Nov 28, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19Apr 22, 2024Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 11 months ago
- A framework for Reinforcement Learning research.☆260Updated this week
- RL Environments in JAX 🌍☆18Dec 2, 2025Updated 6 months ago
- Efficient baselines for autocurricula in JAX.☆214Aug 24, 2024Updated last year
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆257May 21, 2026Updated 3 weeks ago
- Meta in-context learning for protein fitness prediction☆18Feb 7, 2025Updated last year