lowrollr / mctx-azLinks

Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree

☆20

Alternatives and similar repositories for mctx-az

Users that are interested in mctx-az are comparing it to the libraries listed below

Sorting:

lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
Carbon225 / mctx-classic
Classic MCTS example with mctx
☆18Updated 2 years ago
Reytuag / transformerXL_PPO_JAX
☆81Updated 8 months ago
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 8 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
symoon11 / dreamerv3-flax
Flax Implementation of DreamerV3 on Crafter
☆16Updated 4 months ago
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆67Updated 2 years ago
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 3 months ago
epignatelli / navix
Accelerated minigrid environments with JAX
☆141Updated last month
DramaCow / jaxued
☆82Updated 3 months ago
hamishs / JAX-RL
JAX implementations of various deep reinforcement learning algorithms.
☆23Updated 5 months ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆107Updated last year
luchris429 / JaxLife
An Open-Ended Agentic Simulator
☆51Updated 11 months ago
tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆55Updated 2 months ago
kenjyoung / mctx_learning_demo
☆52Updated 2 years ago
instadeepai / flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆239Updated 3 months ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 10 months ago
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆58Updated last year
danijar / ninjax
General Modules for JAX
☆66Updated 3 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆145Updated 2 years ago
RPegoud / jym
JAX implementation of RL algorithms and vectorized environments
☆47Updated last year
MyNameIsArko / RL-Flax
Various reinforcement learning algorithms written in Jax + Flax
☆26Updated 2 years ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆101Updated 8 months ago
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 2 years ago
keraJLi / rejax
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!
☆230Updated last month
twitter-research / hyperbolic-rl
☆55Updated 2 years ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago