instadeepai / sebulbaLinks
πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
β59Updated last year
Alternatives and similar repositories for sebulba
Users that are interested in sebulba are comparing it to the libraries listed below
Sorting:
- β83Updated 10 months ago
- Accelerated replay buffers in JAXβ43Updated 3 years ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ251Updated last month
- A collection of matrix games in JAXβ12Updated 9 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ114Updated last year
- Baselines for gymnax π€β71Updated 2 years ago
- β83Updated last week
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ52Updated 2 years ago
- General Modules for JAXβ67Updated last week
- Accelerated minigrid environments with JAXβ147Updated 2 weeks ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ20Updated 10 months ago
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- β18Updated 4 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β58Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ182Updated 6 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ72Updated last year
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"β28Updated 4 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β56Updated last week
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β234Updated 3 months ago
- An Open-Ended Agentic Simulatorβ52Updated last year
- Corax: Core RL in JAXβ38Updated last year
- A tool for aggregating and plotting MARL experiment data.β77Updated 8 months ago
- β46Updated 2 years ago
- Simple JAX Graphics Library.β36Updated 10 months ago
- Conservative Q learning in Jaxβ55Updated 2 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ94Updated 4 months ago
- Evaluating long-term memory of reinforcement learning algorithmsβ148Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β110Updated last year