DHDev0/Muzero-unplugged

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DHDev0/Muzero-unplugged)

DHDev0 / Muzero-unplugged

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

☆36

Alternatives and similar repositories for Muzero-unplugged

Users that are interested in Muzero-unplugged are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆79Dec 31, 2025Updated 6 months ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
jianzhnie / RLZero
View on GitHub
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆17Oct 15, 2024Updated last year
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
dnoursi / gym-graph-search
View on GitHub
OpenAI Gym environment for graph search problems such as shortest path.
☆11Dec 24, 2019Updated 6 years ago
google-deepmind / constrained_optidice
View on GitHub
☆10Sep 9, 2022Updated 3 years ago
MLD3 / OfflineRL_FactoredActions
View on GitHub
[NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738
☆11Nov 27, 2022Updated 3 years ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Updated this week
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆53Aug 27, 2022Updated 3 years ago
tuero / muzero-cpp
View on GitHub
A C++ pytorch implementation of MuZero
☆40May 18, 2026Updated 2 months ago
amir-abdi / docgpt
View on GitHub
Automatically generate documentation for Python scripts.
☆16Dec 21, 2022Updated 3 years ago
info-structures / ais
View on GitHub
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆23Nov 29, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ucl-dark / skillhack
View on GitHub
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Oct 23, 2022Updated 3 years ago
aletcher / stable-opponent-shaping
View on GitHub
Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).
☆21Jan 15, 2020Updated 6 years ago
sekstini / gpupoor
View on GitHub
☆18Dec 2, 2024Updated last year
BY571 / Implicit-Q-Learning
View on GitHub
PyTorch implementation of the implicit Q-learning algorithm (IQL)
☆44Dec 17, 2021Updated 4 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
chenghands-on / Dreamer_assemble
View on GitHub
An assemble of various world model including dreamer v2 and v3
☆10Sep 9, 2023Updated 2 years ago
YangRui2015 / Model-basedHER
View on GitHub
Model-based Hindsight Experience Replay
☆10Jun 8, 2022Updated 4 years ago
jurgisp / memory-maze
View on GitHub
Evaluating long-term memory of reinforcement learning algorithms
☆180Jun 23, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
whatbirdisthat / cyberdolphin
View on GitHub
Cyberdolphin Suite of ComfyUI nodes for wiring up OpenAI and compatible LLM APIs.
☆15Jul 31, 2024Updated last year
AI4Finance-Foundation / Risk-Management-using-Deep-Learning-for-Midterm-Stock-Prediction-KDD-2019
View on GitHub
Risk Management via Anomaly Circumvent: Mnemonic Deep Learning for Midterm Stock Prediction. KDD 2019.
☆23Aug 26, 2020Updated 5 years ago
scottemmons / rvs
View on GitHub
Reinforcement Learning via Supervised Learning
☆72May 16, 2022Updated 4 years ago
AlexGoldie / rl-learned-optimization
View on GitHub
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆31Dec 15, 2025Updated 7 months ago
hlorenzi / mapvania
View on GitHub
🗺🧱🕹 Project-oriented, tileset- and object-based level editor for games! -- https://hlorenzi.github.io/mapvania/
☆27May 3, 2025Updated last year
lucidrains / scaling-vin-pytorch
View on GitHub
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
☆37Sep 23, 2024Updated last year
bwfbowen / muax
View on GitHub
A project that provides help for using DeepMind's mctx on gym-style environments.
☆66Nov 14, 2024Updated last year
btholt / complete-intro-to-react-v1
View on GitHub
The first version of the complete intro React, complete with Redux react-router v1, Mocha, and Webpack v1
☆21Feb 10, 2022Updated 4 years ago
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Aug 20, 2025Updated 11 months ago
chongminggao / DORL-codes
View on GitHub
Source codes for our SIGIR '23 paper
☆32Dec 14, 2023Updated 2 years ago
GeWu-Lab / MS-Bot
View on GitHub
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆22Jun 25, 2025Updated last year
strin / curriculum-deep-RL
View on GitHub
Design good curriculums for deep reinforcement learning
☆14May 18, 2016Updated 10 years ago
elle-miller / roto
View on GitHub
RoTO is an open-source Reinforcement Learning benchmark environment designed to standardise and promote future research in tactile-based …
☆43Updated this week
YeWR / RLFP
View on GitHub
RLFP (CoRL 2024)
☆14Oct 11, 2024Updated last year
Alexander-Nasuta / graph-jsp-env
View on GitHub
A Gymnasium Environment for the Job Shop Problem Using the Disjunctive Graph Approach.
☆29May 4, 2026Updated 2 months ago