A project that provides help for using DeepMind's mctx on gym-style environments.
☆65Nov 14, 2024Updated last year
Alternatives and similar repositories for muax
Users that are interested in muax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆54Apr 11, 2023Updated 3 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Monte Carlo tree search in JAX☆2,608Sep 2, 2025Updated 7 months ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆127Feb 25, 2026Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆597Mar 6, 2025Updated last year
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,567Apr 5, 2026Updated last week
- ☆47Sep 24, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆824Mar 9, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MuZero☆2,799Sep 3, 2024Updated last year
- Synchronized Curriculum Learning for RL Agents☆124Mar 7, 2026Updated last month
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Modular framework for Reinforcement Learning in python☆184Feb 1, 2023Updated 3 years ago
- General framework for Bayesian inversion of continuous hierarchical models☆10Sep 20, 2021Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23May 25, 2023Updated 2 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆577Feb 25, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆24May 18, 2025Updated 10 months ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Literate regular expressions for bash/grep☆12Aug 6, 2020Updated 5 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆403Mar 18, 2026Updated 3 weeks ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆274Sep 22, 2025Updated 6 months ago
- Anomalous versions of OpenAI Gym and PyBullet3 environments☆15Oct 24, 2021Updated 4 years ago
- RL Environments in JAX 🌍☆880Apr 2, 2026Updated 2 weeks ago
- A collection of graph neural networks implementations in JAX☆35Nov 28, 2023Updated 2 years ago
- ☆19Apr 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 9 months ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆241Feb 26, 2026Updated last month
- A framework for Reinforcement Learning research.☆253Apr 3, 2026Updated last week
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Meta in-context learning for protein fitness prediction☆16Feb 7, 2025Updated last year
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago