michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆27Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for muzero
- ☆48Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated last year
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆44Updated last week
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆80Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated 3 weeks ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆68Updated 2 weeks ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆56Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago
- ☆28Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆53Updated 2 years ago
- Partially Observable Process Gym☆166Updated 4 months ago
- ☆13Updated last year
- Code for Shapley values for explaining reinforcement learning. XRL feature-influence method.☆15Updated 10 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆26Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆51Updated this week
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- Emergence of complex strategies through multiagent competition☆42Updated last year
- Tabular methods for reinforcement learning☆34Updated 4 years ago