michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆33Updated last year
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 4 months ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆87Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- ☆50Updated last year
- Accelerated minigrid environments with JAX☆132Updated 8 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆25Updated 9 months ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆56Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆95Updated last year
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆46Updated last year
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆39Updated 4 years ago
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated last year
- Explainable Reinforcement Learning (XRL) Resources☆38Updated 6 months ago
- ☆193Updated 3 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆86Updated 3 weeks ago
- MiniZero: An AlphaZero and MuZero Training Framework☆85Updated last month
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆229Updated last week
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆50Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆77Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆36Updated 2 years ago
- ☆75Updated 2 weeks ago
- Neuro-evolution for OpenAI Gym environments☆56Updated 4 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago