instadeepai / sebulba
πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
β57Updated last year
Alternatives and similar repositories for sebulba
Users that are interested in sebulba are comparing it to the libraries listed below
Sorting:
- β79Updated 6 months ago
- A collection of matrix games in JAXβ11Updated 5 months ago
- β77Updated last month
- Baselines for gymnax π€β66Updated 2 years ago
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- General Modules for JAXβ65Updated last month
- β19Updated this week
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ112Updated 8 months ago
- Vectorization techniques for fast population-based training.β56Updated 2 years ago
- An Open-Ended Agentic Simulatorβ49Updated 9 months ago
- An implementation of MuZero in JAX.β56Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ161Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ71Updated 3 weeks ago
- JAX implementations of core Deep RL algorithmsβ79Updated 3 years ago
- A collection of RL algorithms written in JAX.β97Updated 2 years ago
- Accelerated minigrid environments with JAXβ135Updated this week
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ236Updated last month
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ52Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β55Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β205Updated last month
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ51Updated 2 years ago
- Learning diverse options through the Laplacian representation.β23Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"β22Updated 3 weeks ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β50Updated last week
- Simple JAX Graphics Library.β36Updated 6 months ago
- Clean single-file implementation of offline RL algorithms in JAXβ144Updated 4 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ17Updated 6 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β100Updated last year
- Highly scalable 2D JAX physics engine.β56Updated 2 months ago