Hwhitetooth / jax_muzeroView external linksLinks
An implementation of MuZero in JAX.
☆57Nov 8, 2022Updated 3 years ago
Alternatives and similar repositories for jax_muzero
Users that are interested in jax_muzero are comparing it to the libraries listed below
Sorting:
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- AlphaZero in JAX☆81Apr 3, 2024Updated last year
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆923Dec 20, 2023Updated 2 years ago
- RL Environments in JAX 🌍☆857May 30, 2025Updated 8 months ago
- Reinforcement learning library in JAX.☆100Oct 22, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆18Jun 30, 2021Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82May 13, 2024Updated last year
- minGPT in JAX☆48Jan 10, 2022Updated 4 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆121Updated this week
- ☆19Mar 1, 2023Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Aug 22, 2024Updated last year
- ☆42May 11, 2022Updated 3 years ago
- an environment based on XLA for deep learning compiler optimization research.☆24Mar 7, 2023Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆64Nov 14, 2024Updated last year
- General Modules for JAX☆72Sep 12, 2025Updated 5 months ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆750Oct 26, 2022Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆585Mar 6, 2025Updated 11 months ago
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- krazy grid world☆25Mar 2, 2020Updated 5 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- Conservative Q Learning on top of SAC☆136Oct 15, 2022Updated 3 years ago
- ☆46Sep 24, 2024Updated last year
- Reinforcement learning with Rust☆14Jul 31, 2022Updated 3 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,269Aug 12, 2024Updated last year
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- Perf monitoring CLI tool for Apple Silicon☆11Jan 25, 2023Updated 3 years ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Monte Carlo tree search in JAX☆2,589Sep 2, 2025Updated 5 months ago
- ☆52Jan 20, 2023Updated 3 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,530Feb 8, 2026Updated last week
- ☆31Aug 25, 2022Updated 3 years ago