DHDev0 / MuzeroLinks
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
☆19Updated 2 years ago
Alternatives and similar repositories for Muzero
Users that are interested in Muzero are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆69Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 11 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- Reinforcement learning training framework for entity-gym environments.☆17Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆31Updated last month
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆52Updated 2 years ago
- The source code for the gym-microrts paper.☆42Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆88Updated 4 years ago
- Code and links for over 25,000 trained Atari agents☆97Updated 11 months ago
- Python implementation of the genetic algorithm MAP-Elites with applications in constrained optimization☆55Updated 4 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- Tabular methods for reinforcement learning☆38Updated 5 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆35Updated last year
- Neuroevolution Benchmark in JAX 🦕☆39Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆160Updated 4 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆97Updated 2 weeks ago
- Gridworld environments for OpenAI gym.☆80Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆133Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- A collection of papers on divergence and quality diversity☆77Updated 3 years ago
- A collection of RL algorithms written in JAX.☆102Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago