instadeepai / sebulbaLinks
πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
β57Updated last year
Alternatives and similar repositories for sebulba
Users that are interested in sebulba are comparing it to the libraries listed below
Sorting:
- A collection of matrix games in JAXβ11Updated 6 months ago
- β80Updated 7 months ago
- β79Updated 2 months ago
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- General Modules for JAXβ66Updated 2 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β53Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ167Updated 2 months ago
- JAX implementations of core Deep RL algorithmsβ79Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ112Updated 9 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ52Updated 2 years ago
- Accelerated minigrid environments with JAXβ138Updated 3 weeks ago
- Baselines for gymnax π€β66Updated 2 years ago
- Corax: Core RL in JAXβ38Updated last year
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ236Updated 2 months ago
- An Open-Ended Agentic Simulatorβ51Updated 9 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"β22Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ80Updated last month
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ52Updated 2 years ago
- β19Updated 3 weeks ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.β77Updated 4 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)β10Updated last year
- A collection of RL algorithms written in JAX.β98Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β219Updated last week
- Vectorization techniques for fast population-based training.β56Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β101Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β56Updated 2 years ago
- An implementation of MuZero in JAX.β56Updated 2 years ago
- Simple JAX Graphics Library.β36Updated 7 months ago
- Learning diverse options through the Laplacian representation.β23Updated last year