rlglab / minizeroLinks

MiniZero: An AlphaZero and MuZero Training Framework

☆94

Alternatives and similar repositories for minizero

Users that are interested in minizero are comparing it to the libraries listed below

Sorting:

lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
DHDev0 / Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆67Updated last year
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 8 months ago
tuero / muzero-cpp
A C++ pytorch implementation of MuZero
☆39Updated last year
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 10 months ago
Reytuag / transformerXL_PPO_JAX
☆80Updated 8 months ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆107Updated last year
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
baskuit / R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
☆47Updated 2 years ago
Carbon225 / mctx-classic
Classic MCTS example with mctx
☆18Updated 2 years ago
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆159Updated 4 years ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
DramaCow / jaxued
☆82Updated 3 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆104Updated 3 weeks ago
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆58Updated last year
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆189Updated 10 months ago
NTT123 / a0-jax
AlphaZero in JAX
☆78Updated last year
luchris429 / JaxLife
An Open-Ended Agentic Simulator
☆51Updated 11 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
kenjyoung / mctx_learning_demo
☆52Updated 2 years ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆101Updated 8 months ago
instadeepai / flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆239Updated 3 months ago
epignatelli / navix
Accelerated minigrid environments with JAX
☆141Updated last month
MichaelTMatthews / Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆322Updated last week
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆80Updated last year
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago
google-research / reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆96Updated 2 years ago
cassidylaidlaw / effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆48Updated last year
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆27Updated 2 weeks ago
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 3 months ago