Carbon225/mctx-classic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Carbon225/mctx-classic)

Carbon225 / mctx-classic

Classic MCTS example with mctx

☆24

Alternatives and similar repositories for mctx-classic

Users that are interested in mctx-classic are comparing it to the libraries listed below

Sorting:

kenjyoung / mctx_learning_demo
View on GitHub
☆54Apr 11, 2023Updated 2 years ago
bwfbowen / muax
View on GitHub
A project that provides help for using DeepMind's mctx on gym-style environments.
☆65Nov 14, 2024Updated last year
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆43Sep 19, 2022Updated 3 years ago
Egiob / cfrx
View on GitHub
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
☆37Aug 8, 2024Updated last year
rystrauss / dopamax
View on GitHub
Reinforcement learning in pure JAX.
☆13Dec 24, 2025Updated 2 months ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆14May 19, 2023Updated 2 years ago
instadeepai / flashbax
View on GitHub
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆273Sep 22, 2025Updated 5 months ago
symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 3 months ago
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
NTT123 / a0-jax
View on GitHub
AlphaZero in JAX
☆81Apr 3, 2024Updated last year
sotetsuk / pgx
View on GitHub
♟️ Vectorized RL game environments in JAX
☆591Mar 6, 2025Updated last year
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆23Feb 15, 2025Updated last year
patrick-kidger / esm2quinox
View on GitHub
An implementation of ESM2 in Equinox+JAX
☆36Jun 5, 2025Updated 9 months ago
lowrollr / turbozero
View on GitHub
fast + parallel AlphaZero in JAX
☆109Dec 22, 2024Updated last year
AranKomat / Alpha-Transformer
View on GitHub
Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search
☆28Nov 15, 2018Updated 7 years ago
jeremiecoullon / jax-tqdm
View on GitHub
Add a tqdm progress bar to your JAX scans and loops.
☆124May 9, 2025Updated 10 months ago
evanatyourservice / psgd_jax
View on GitHub
Implementation of PSGD optimizer in JAX
☆35Dec 31, 2024Updated last year
DHDev0 / Stochastic-muzero
View on GitHub
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆77Dec 31, 2025Updated 2 months ago
CarperAI / Algorithm-Distillation-RLHF
View on GitHub
☆35Jan 29, 2023Updated 3 years ago
google-deepmind / eigengame
View on GitHub
Open source code for EigenGame.
☆35May 15, 2023Updated 2 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
adityauser / Quest-RL
View on GitHub
Analysing result obtained using quite different RL algorithm
☆13Sep 5, 2019Updated 6 years ago
mind-games-challenge / mindgames-starter-kit
View on GitHub
The official starter-kit for NeurIPS 2025 mind games competition
☆21Jul 27, 2025Updated 7 months ago
epignatelli / navix
View on GitHub
Accelerated minigrid environments with JAX
☆162Oct 20, 2025Updated 4 months ago
instadeepai / jumanji
View on GitHub
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
☆810Dec 1, 2025Updated 3 months ago
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated last year
OPR-Project / ITLP-Campus
View on GitHub
About The dataset was recorded on the Husky robotics platform on the university campus and consists of 5 tracks recorded at different tim…
☆11Mar 25, 2025Updated 11 months ago
Swynfel / rust-catan
View on GitHub
A Rust implementation for the board game Catan
☆12May 6, 2020Updated 5 years ago
joonyeol-sim / awesome-mapf
View on GitHub
☆15Jul 27, 2023Updated 2 years ago
PearLauncher / pearlauncher.github.io
View on GitHub
Website of pear launcher
☆10Mar 19, 2024Updated last year
jduquevan / advantage-alignment
View on GitHub
Advantage Alignment Algorithms (ICLR 2025 oral)
☆17Apr 7, 2025Updated 11 months ago
RPegoud / jym
View on GitHub
JAX implementation of RL algorithms and vectorized environments
☆51Dec 26, 2023Updated 2 years ago
aviborg / MonitorKNXRF
View on GitHub
This code monitors (or sniff) the radiosignals sent by Uponor KNX RF thermostats and sent to OpenHAB using the REST interface. A CC1101 c…
☆11Dec 2, 2022Updated 3 years ago
burchim / DreamerV3-PyTorch
View on GitHub
PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.
☆10Feb 16, 2024Updated 2 years ago
autonomousvision / ppo.cpp
View on GitHub
☆20Dec 1, 2025Updated 3 months ago
TheDuckAI / prm
View on GitHub
☆12Jan 17, 2025Updated last year
MediatekAndroidDevelopers / android_kernel_vernee_apollo_lite
View on GitHub
Kernel Source for Vernee Apollo Lite & X
☆11Dec 29, 2017Updated 8 years ago
dschaurecker / bitepy
View on GitHub
A Battery Intraday Trading Engine, based on dynamic programming approximations, written in C++, wrapped for Python
☆35Feb 5, 2026Updated last month
JuliaPOMDP / TabularTDLearning.jl
View on GitHub
Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
☆13Nov 16, 2025Updated 3 months ago