Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆27May 2, 2025Updated last year
Alternatives and similar repositories for mctx-az
Users that are interested in mctx-az are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- ☆15Jul 9, 2024Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆607Mar 6, 2025Updated last year
- AlphaZero in JAX☆82Apr 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆129May 9, 2026Updated 2 weeks ago
- ☆55Apr 11, 2023Updated 3 years ago
- Reinforcement learning with Equinox☆20Mar 4, 2025Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Flexible Inference for Predictive Coding Networks in JAX.☆82Apr 8, 2026Updated last month
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 11 months ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI☆10Feb 6, 2026Updated 3 months ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 5 months ago
- ☆14Jul 12, 2024Updated last year
- UNet script, model, sample data☆14Feb 19, 2025Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,591May 12, 2026Updated 2 weeks ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation and evaluation of the AXIOM architecture from the preprint "AXIOM: Learning to Play Games in Minutes with Expanding Object…☆74Jun 2, 2025Updated 11 months ago
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- PEP 503 repository index for jax[cuda]☆21Jan 14, 2025Updated last year
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆21Dec 7, 2024Updated last year
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- ☆22Jan 19, 2024Updated 2 years ago
- What's the simplest Turing Machine with unknown behavior?☆13Jun 18, 2016Updated 9 years ago
- Implementations of the renormalization group-based diffusion model (RGDM).☆16Mar 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Mar 3, 2025Updated last year
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆50Apr 1, 2024Updated 2 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- Functional algorithms - definitions and implementations☆13Oct 17, 2025Updated 7 months ago
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- ☆11Jun 17, 2016Updated 9 years ago