lowrollr / mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆14Updated 8 months ago
Related projects: ⓘ
- fast + parallel AlphaZero in JAX☆80Updated 5 months ago
- ☆59Updated last month
- ☆56Updated 3 weeks ago
- ☆141Updated 2 weeks ago
- An implementation of MuZero in JAX.☆52Updated last year
- ☆46Updated last year
- General Modules for JAX☆57Updated last month
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆82Updated 9 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆10Updated 2 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆46Updated 5 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Baselines for gymnax 🤖☆57Updated last year
- Efficient baselines for autocurricula in JAX.☆165Updated 3 weeks ago
- AlphaZero in JAX☆68Updated 5 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆189Updated 2 weeks ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- An Open-Ended Agentic Simulator☆17Updated last month
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Scaling scaling laws with board games.☆36Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated last month
- Accelerated minigrid environments with JAX☆102Updated last month
- ☆25Updated this week
- Classic MCTS example with mctx☆15Updated last year
- ☆34Updated 2 years ago
- ☆17Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆59Updated last month
- Standard interface for entity based reinforcement learning environments.☆35Updated 6 months ago
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago