JimOhman / model-based-rlLinks

Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).

☆33

Alternatives and similar repositories for model-based-rl

Users that are interested in model-based-rl are comparing it to the libraries listed below

Sorting:

daisatojp / mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆76Updated 2 years ago
DHDev0 / Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆66Updated last year
michaelnny / deep_rl_zoo
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…
☆113Updated last year
openai / phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
☆262Updated 2 years ago
yusukeurakami / dreamer-pytorch
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆166Updated 5 months ago
jurgisp / pydreamer
PyTorch implementation of DreamerV2 model-based RL algorithm
☆219Updated 2 years ago
ray-project / rl-experiments
Keeping track of RL experiments
☆161Updated 2 years ago
kc-ml2 / SimpleDreamer
A Simplified Pytorch Version of the Dreamer Algorithm
☆128Updated last year
proroklab / popgym
Partially Observable Process Gym
☆193Updated 2 weeks ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆170Updated 3 years ago
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆157Updated last week
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆113Updated 10 months ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆49Updated last month
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆177Updated 11 months ago
vincent-thevenin / DreamerV2-Pytorch
Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS
☆50Updated 3 years ago
jsikyoon / dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.
☆126Updated 3 years ago
adityabingi / Dreamer
Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite
☆39Updated 2 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆132Updated 11 months ago
evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆100Updated 3 years ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆100Updated 7 months ago
baskuit / R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
☆47Updated 2 years ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
JBLanier / pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆51Updated 9 months ago
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 7 months ago
jsikyoon / V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
☆13Updated 4 years ago
etaoxing / multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch
☆47Updated 2 years ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆143Updated 2 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 2 years ago
Farama-Foundation / D4RL-Evaluations
☆198Updated 2 years ago
x35f / unstable_baselines
Re-implementations of SOTA RL algorithms.
☆133Updated last year