instadeepai / sebulbaLinks
πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
β59Updated last year
Alternatives and similar repositories for sebulba
Users that are interested in sebulba are comparing it to the libraries listed below
Sorting:
- Accelerated replay buffers in JAXβ43Updated 3 years ago
- β84Updated 11 months ago
- β83Updated last month
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ253Updated 2 weeks ago
- Accelerated minigrid environments with JAXβ150Updated last month
- General Modules for JAXβ67Updated 3 weeks ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ115Updated last year
- A collection of matrix games in JAXβ12Updated 10 months ago
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ184Updated 6 months ago
- Baselines for gymnax π€β72Updated 2 years ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ52Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β236Updated 4 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ72Updated last year
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ20Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β58Updated 3 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β58Updated 2 weeks ago
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ111Updated 5 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β110Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"β50Updated last year
- An Open-Ended Agentic Simulatorβ52Updated last year
- β18Updated 4 months ago
- β48Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithmsβ147Updated 2 years ago
- An implementation of MuZero in JAX.β57Updated 2 years ago
- β45Updated last year
- Conservative Q learning in Jaxβ55Updated 2 years ago
- Corax: Core RL in JAXβ38Updated last year