Classic MCTS example with mctx
☆25May 25, 2023Updated 2 years ago
Alternatives and similar repositories for mctx-classic
Users that are interested in mctx-classic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- ☆55Apr 11, 2023Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- An environment for learning formal mathematical reasoning from scratch☆71Aug 18, 2024Updated last year
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆279Sep 22, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆603Mar 6, 2025Updated last year
- AlphaZero in JAX☆83Apr 3, 2024Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 4 months ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- Distributed deep learning cluster simulation environment and RL-GNN resource management implementations.☆14Feb 1, 2023Updated 3 years ago
- ☆19Mar 1, 2023Updated 3 years ago
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated 2 weeks ago
- Add a tqdm progress bar to your JAX scans and loops.☆128May 9, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- X-elerated Learning and Resource Allocation for Optical Networks☆21Updated this week
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Accelerated minigrid environments with JAX☆170Oct 20, 2025Updated 6 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆55Apr 21, 2023Updated 3 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 5 months ago
- Monte Carlo tree search in JAX☆2,619Sep 2, 2025Updated 8 months ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Mar 21, 2025Updated last year
- ☆70Nov 9, 2023Updated 2 years ago
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆828Apr 13, 2026Updated 3 weeks ago
- ☆14Aug 15, 2024Updated last year
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- ☆19Jan 16, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Efficient Conway's Game of Life implemented in Python using NumPy.☆14May 1, 2024Updated 2 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 4 months ago
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆87Mar 14, 2025Updated last year
- Monte Carlo Tree Search with Reinforcement Learning for Motion Planning☆80Sep 23, 2020Updated 5 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year