michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆29Updated last year
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆39Updated 4 years ago
- fast + parallel AlphaZero in JAX☆92Updated 2 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆55Updated 3 months ago
- ☆50Updated last year
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Accelerated minigrid environments with JAX☆132Updated 7 months ago
- Baselines for gymnax 🤖☆65Updated last year
- An API conversion tool for popular external reinforcement learning environments☆152Updated last month
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 6 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆74Updated last year
- Reinforcement learning training framework for entity-gym environments.☆17Updated 11 months ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- ☆100Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- ☆70Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- ☆28Updated 2 years ago
- Partially Observable Process Gym☆178Updated 7 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆94Updated last year
- Modular framework for Reinforcement Learning in python☆171Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆24Updated 8 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago