bwfbowen / muaxView external linksLinks
A project that provides help for using DeepMind's mctx on gym-style environments.
☆64Nov 14, 2024Updated last year
Alternatives and similar repositories for muax
Users that are interested in muax are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 9 months ago
- ☆53Apr 11, 2023Updated 2 years ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- fast + parallel AlphaZero in JAX☆109Dec 22, 2024Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Dec 31, 2025Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆121Updated this week
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- ☆46Sep 24, 2024Updated last year
- Monte Carlo tree search in JAX☆2,589Sep 2, 2025Updated 5 months ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Oct 6, 2021Updated 4 years ago
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,530Updated this week
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆804Dec 1, 2025Updated 2 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆924Dec 20, 2023Updated 2 years ago
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- General framework for Bayesian inversion of continuous hierarchical models☆10Sep 20, 2021Updated 4 years ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆270Sep 22, 2025Updated 4 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 7 months ago
- Synchronized Curriculum Learning for RL Agents☆122Feb 1, 2026Updated last week
- RL Environments in JAX 🌍☆857May 30, 2025Updated 8 months ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- Extending rllab to event-driven multiagent environments☆13Oct 1, 2018Updated 7 years ago
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated last year
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- ☆12Apr 22, 2022Updated 3 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆390Oct 29, 2025Updated 3 months ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Efficient baselines for autocurricula in JAX.☆206Aug 24, 2024Updated last year
- Project Page: Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination (IROS 2024)☆22Apr 25, 2025Updated 9 months ago
- Meta in-context learning for protein fitness prediction☆16Feb 7, 2025Updated last year
- ☆19Jan 16, 2025Updated last year
- A collection of matrix games in JAX☆13Nov 28, 2024Updated last year
- Modular framework for Reinforcement Learning in python☆183Feb 1, 2023Updated 3 years ago
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆229Jan 24, 2026Updated 2 weeks ago
- COMPASS: Combinatorial Optimization with Policy Adaptation using Latent Space Search☆42Jun 21, 2024Updated last year