michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆30Updated last year
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆73Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated 2 years ago
- ☆49Updated last year
- Baselines for gymnax 🤖☆61Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 5 months ago
- Explainable Reinforcement Learning (XRL) Resources☆37Updated 4 months ago
- ☆13Updated last year
- Various reinforcement learning algorithms written in Jax + Flax☆23Updated last year
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆37Updated 4 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆24Updated 7 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆93Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆53Updated 2 months ago
- fast + parallel AlphaZero in JAX☆90Updated last month
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆51Updated last week
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- On-the-fly conversions between Jax and NumPy tensors☆49Updated last year
- Accelerated minigrid environments with JAX☆128Updated 5 months ago
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆56Updated 2 years ago
- ☆67Updated 5 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆94Updated last year
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆76Updated last month
- Benchmarking RL generalization in an interpretable way.☆138Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆34Updated 2 years ago