A project that provides help for using DeepMind's mctx on gym-style environments.
☆65Nov 14, 2024Updated last year
Alternatives and similar repositories for muax
Users that are interested in muax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 10 months ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆54Apr 11, 2023Updated 2 years ago
- fast + parallel AlphaZero in JAX☆110Dec 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 2 months ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆125Feb 25, 2026Updated last month
- Monte Carlo tree search in JAX☆2,602Sep 2, 2025Updated 6 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- ♟️ Vectorized RL game environments in JAX☆595Mar 6, 2025Updated last year
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,550Updated this week
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆928Dec 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆47Sep 24, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆820Mar 9, 2026Updated 2 weeks ago
- ☆19Jan 16, 2025Updated last year
- MuZero☆2,791Sep 3, 2024Updated last year
- Synchronized Curriculum Learning for RL Agents☆123Mar 7, 2026Updated 2 weeks ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Modular framework for Reinforcement Learning in python☆184Feb 1, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- General framework for Bayesian inversion of continuous hierarchical models☆10Sep 20, 2021Updated 4 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23May 25, 2023Updated 2 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆571Feb 25, 2026Updated last month
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆23May 18, 2025Updated 10 months ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆400Mar 18, 2026Updated last week
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆274Sep 22, 2025Updated 6 months ago
- Anomalous versions of OpenAI Gym and PyBullet3 environments☆15Oct 24, 2021Updated 4 years ago
- RL Environments in JAX 🌍☆873May 30, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of graph neural networks implementations in JAX☆35Nov 28, 2023Updated 2 years ago
- ☆19Apr 22, 2024Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 9 months ago
- Project Page: Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination (IROS 2024)☆22Apr 25, 2025Updated 11 months ago
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆236Feb 26, 2026Updated last month
- A framework for Reinforcement Learning research.☆252Updated this week