tuero/muzero-cpp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tuero/muzero-cpp)

tuero / muzero-cpp

A C++ pytorch implementation of MuZero

☆40

Alternatives and similar repositories for muzero-cpp

Users that are interested in muzero-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆79Dec 31, 2025Updated 6 months ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
jianzhnie / RLZero
View on GitHub
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
☆17Oct 15, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rlglab / optionzero
View on GitHub
[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm
☆28May 18, 2025Updated last year
kaesve / muzero
View on GitHub
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆169Mar 28, 2021Updated 5 years ago
adepierre / Caffe_AlphaZero
View on GitHub
Implementation of Deepmind's AlphaZero algorithm with Caffe and C++
☆20Apr 14, 2018Updated 8 years ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Updated this week
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
xionghuichen / MAPLE
View on GitHub
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
☆25Jan 16, 2024Updated 2 years ago
lucadellalib / actorch
View on GitHub
Deep reinforcement learning framework for fast prototyping based on PyTorch
☆14Mar 12, 2023Updated 3 years ago
abhilash1910 / Deep_Reinforcement_Learning_Trading
View on GitHub
Deep Reinforcement Learning for Trading
☆30Oct 10, 2022Updated 3 years ago
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
robbycostales / HAL
View on GitHub
Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)
☆14Mar 14, 2022Updated 4 years ago
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆938Dec 20, 2023Updated 2 years ago
peldszus / alpha-zero-general-lib
View on GitHub
An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice
☆11Aug 30, 2020Updated 5 years ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
levelupai / rl-slg
View on GitHub
Reinforcement learning training project for a SLG game
☆13Dec 21, 2017Updated 8 years ago
danielwillemsen / PendulumDemo
View on GitHub
Model-Based RL Demo for Pendulum-v0
☆13Jun 16, 2020Updated 6 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
cwher / RL-Sekiro
View on GitHub
☆12Jun 30, 2022Updated 4 years ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lastland / ClairvoyanceMonad
View on GitHub
The Coq formalization of the paper Reasoning about the garden of forking paths.
☆25Feb 7, 2025Updated last year
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
entity-neural-network / entity-gym
View on GitHub
Standard interface for entity based reinforcement learning environments.
☆39Feb 28, 2024Updated 2 years ago
sugiyama404 / ReinfoceLearningForTrading
View on GitHub
☆13Mar 31, 2024Updated 2 years ago
paulnovello / HSIC-Attribution-Method
View on GitHub
☆15Jan 2, 2023Updated 3 years ago
jiangsy / slbo_pytorch
View on GitHub
☆15Sep 14, 2020Updated 5 years ago
ctallec / continuous-rl
View on GitHub
☆20Apr 29, 2019Updated 7 years ago
rangl-labs / netzerotc
View on GitHub
☆11Jul 15, 2022Updated 4 years ago
maxlamberti / optimal-order-execution
View on GitHub
Exploring Optimal Order Execution in Simulated Limit Order Books
☆20Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
werner-duvaud / muzero-general
View on GitHub
MuZero
☆2,844Sep 3, 2024Updated last year
ucla-rlcourse / competitive-rl
View on GitHub
A set of competitive environments for Reinforcement Learning research.
☆31Dec 1, 2022Updated 3 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
RobertTLange / gymnax-blines
View on GitHub
Baselines for gymnax 🤖
☆78Apr 3, 2023Updated 3 years ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
NVlabs / cule
View on GitHub
CuLE: A CUDA port of the Atari Learning Environment (ALE)
☆243Nov 21, 2022Updated 3 years ago
timeolord / Reinforcement-Learning-Stock-Trader
View on GitHub
Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…
☆20Jun 24, 2026Updated 3 weeks ago