DHDev0 / MuzeroLinks
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
☆19Updated 2 years ago
Alternatives and similar repositories for Muzero
Users that are interested in Muzero are comparing it to the libraries listed below
Sorting:
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- Neuroevolution Benchmark in JAX 🦕☆42Updated 2 years ago
- Baselines for gymnax 🤖☆74Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Updated 6 months ago
- ☆53Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated last week
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Vectorization techniques for fast population-based training.☆57Updated 3 years ago
- Gridworld environments for OpenAI gym.☆79Updated last year
- An implementation of MuZero in JAX.☆58Updated 3 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Updated 5 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆33Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆11Updated 7 months ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- Fully differentiable RL environments, written in Ivy.☆66Updated 2 years ago
- Accelerated minigrid environments with JAX☆154Updated 2 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- General Modules for JAX☆72Updated 3 months ago
- A Python Toolkit for Managing a Large Number of Experiments☆31Updated last year