michaelnny / muzeroLinks

A PyTorch implementation of DeepMind's MuZero agent

☆35

Alternatives and similar repositories for muzero

Users that are interested in muzero are comparing it to the libraries listed below

Sorting:

lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
kenjyoung / mctx_learning_demo
☆51Updated 2 years ago
Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆96Updated 4 years ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆113Updated 10 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
epignatelli / navix
Accelerated minigrid environments with JAX
☆140Updated 3 weeks ago
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆159Updated 4 years ago
DHDev0 / Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆66Updated last year
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 7 months ago
instadeepai / awesome-marl
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
☆52Updated 2 years ago
DramaCow / jaxued
☆82Updated 3 months ago
alirezakazemipour / Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29Updated last month
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆67Updated 2 years ago
google-deepmind / zipfian_environments
☆28Updated 2 years ago
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆80Updated last year
adityabingi / Dreamer
Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite
☆39Updated 2 years ago
damat-le / gym-simplegrid
Simple Grid Environment for Gymnasium
☆59Updated 4 months ago
rlglab / minizero
MiniZero: An AlphaZero and MuZero Training Framework
☆94Updated 4 months ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆103Updated last year
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆100Updated 8 months ago
cswinter / DeepCodeCraft
Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.
☆21Updated 2 years ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆36Updated 3 months ago
timoklein / alphazero-gym
AlphaZero for continuous control tasks
☆23Updated 2 years ago
google-research / reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆96Updated last year
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆157Updated last week
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
Reytuag / transformerXL_PPO_JAX
☆79Updated 7 months ago
Farama-Foundation / Shimmy
An API conversion tool for popular external reinforcement learning environments
☆173Updated 5 months ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated last year