DHDev0 / MuzeroLinks
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
☆18Updated 2 years ago
Alternatives and similar repositories for Muzero
Users that are interested in Muzero are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆67Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆28Updated 3 weeks ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 10 months ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Neuroevolution Benchmark in JAX 🦕☆39Updated last year
- Python implementation of the genetic algorithm MAP-Elites with applications in constrained optimization☆54Updated 4 years ago
- ☆52Updated 2 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Updated 4 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆35Updated last year
- Reinforcement learning algorithms in RLlib☆60Updated last year
- Baselines for gymnax 🤖☆67Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Code and links for over 25,000 trained Atari agents☆96Updated 10 months ago
- Model-Based RL Demo for Pendulum-v0☆13Updated 5 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆57Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆87Updated 4 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆10Updated last month
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆81Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last year
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆53Updated 2 years ago
- ☆20Updated 5 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Reinforcement learning training framework for entity-gym environments.☆17Updated last year