Classic MCTS example with mctx
☆24May 25, 2023Updated 2 years ago
Alternatives and similar repositories for mctx-classic
Users that are interested in mctx-classic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54Apr 11, 2023Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆65Nov 14, 2024Updated last year
- An environment for learning formal mathematical reasoning from scratch☆72Aug 18, 2024Updated last year
- FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…☆13Mar 19, 2026Updated 3 weeks ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆274Sep 22, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆37Mar 11, 2026Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- ♟️ Vectorized RL game environments in JAX☆597Mar 6, 2025Updated last year
- AlphaZero in JAX☆82Apr 3, 2024Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 3 months ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- Distributed deep learning cluster simulation environment and RL-GNN resource management implementations.☆14Feb 1, 2023Updated 3 years ago
- ☆19Mar 1, 2023Updated 3 years ago
- fast + parallel AlphaZero in JAX☆111Dec 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An implementation of ESM2 in Equinox+JAX☆36Jun 5, 2025Updated 10 months ago
- Add a tqdm progress bar to your JAX scans and loops.☆126May 9, 2025Updated 11 months ago
- X-elerated Learning and Resource Allocation for Optical Networks☆21Feb 24, 2026Updated last month
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Accelerated minigrid environments with JAX☆166Oct 20, 2025Updated 5 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆54Apr 21, 2023Updated 2 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python tools☆14Oct 22, 2023Updated 2 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 4 months ago
- Monte Carlo tree search in JAX☆2,608Sep 2, 2025Updated 7 months ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Mar 21, 2025Updated last year
- ☆70Nov 9, 2023Updated 2 years ago
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆824Mar 9, 2026Updated last month
- ☆12Jan 17, 2025Updated last year
- ☆14Aug 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Efficient Conway's Game of Life implemented in Python using NumPy.☆14May 1, 2024Updated last year
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆77Dec 31, 2025Updated 3 months ago
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Monte Carlo Tree Search with Reinforcement Learning for Motion Planning☆81Sep 23, 2020Updated 5 years ago