Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆26May 2, 2025Updated 10 months ago
Alternatives and similar repositories for mctx-az
Users that are interested in mctx-az are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- ☆13Jul 9, 2024Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆65Nov 14, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆595Mar 6, 2025Updated last year
- ☆19Jan 16, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- AlphaZero in JAX☆81Apr 3, 2024Updated last year
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆125Feb 25, 2026Updated last month
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 4 months ago
- ☆54Apr 11, 2023Updated 2 years ago
- Reinforcement learning with Equinox☆20Mar 4, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Code for the simulations in the neural Kalman filtering paper☆23Jul 13, 2021Updated 4 years ago
- Flexible Inference for Predictive Coding Networks in JAX.☆77Mar 9, 2026Updated 2 weeks ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 9 months ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 3 months ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jul 12, 2024Updated last year
- UNet script, model, sample data☆14Feb 19, 2025Updated last year
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Website of pear launcher☆10Mar 19, 2024Updated 2 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆20Dec 7, 2024Updated last year
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- ☆21Jan 19, 2024Updated 2 years ago
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- Computer go engine using Monte-Carlo Tree Search written in Python3.☆71Sep 2, 2025Updated 6 months ago
- [Unofficial Mirror] GDI++ ported to EasyHook☆12Aug 5, 2014Updated 11 years ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆49Apr 1, 2024Updated last year