Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆27May 2, 2025Updated last year
Alternatives and similar repositories for mctx-az
Users that are interested in mctx-az are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- ☆15Jul 9, 2024Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- ♟️ Vectorized RL game environments in JAX☆617Mar 6, 2025Updated last year
- AlphaZero in JAX☆82Apr 3, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An exploration of LLM steering☆26Jun 15, 2024Updated 2 years ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 7 months ago
- ☆55Apr 11, 2023Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Flexible Inference for Predictive Coding Networks in JAX.☆83May 29, 2026Updated 2 weeks ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 11 months ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 5 months ago
- UNet script, model, sample data☆14Feb 19, 2025Updated last year
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,607Updated this week
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Implementation and evaluation of the AXIOM architecture from the preprint "AXIOM: Learning to Play Games in Minutes with Expanding Object…☆77Jun 2, 2025Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A programming language that deduces code from tests☆30Jan 8, 2018Updated 8 years ago
- PEP 503 repository index for jax[cuda]☆21Jan 14, 2025Updated last year
- ☆10May 1, 2023Updated 3 years ago
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆22Dec 7, 2024Updated last year
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- What's the simplest Turing Machine with unknown behavior?☆13Jun 18, 2016Updated 9 years ago
- Computer go engine using Monte-Carlo Tree Search written in Python3.☆71Sep 2, 2025Updated 9 months ago
- Implementations of the renormalization group-based diffusion model (RGDM).☆15Mar 10, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆53Apr 1, 2024Updated 2 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- RL agent for Hollow Knight: Silksong boss fights☆54Dec 28, 2025Updated 5 months ago
- Pytorch Implementation of MuZero☆355Jul 23, 2023Updated 2 years ago
- Danish National Championship in AI 2025☆23Aug 4, 2025Updated 10 months ago
- PyTorch implementation of Optimistic Adam proposed in Training GANs with Optimism (https://arxiv.org/pdf/1711.00141.pdf)☆20Jan 16, 2021Updated 5 years ago
- ☆15Oct 25, 2023Updated 2 years ago