instadeepai / sebulbaLinks
πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
β58Updated last year
Alternatives and similar repositories for sebulba
Users that are interested in sebulba are comparing it to the libraries listed below
Sorting:
- β82Updated 3 months ago
- β81Updated 8 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ114Updated 10 months ago
- General Modules for JAXβ66Updated 3 months ago
- Accelerated minigrid environments with JAXβ141Updated last month
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ239Updated 3 months ago
- A collection of matrix games in JAXβ11Updated 7 months ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ173Updated 3 months ago
- Baselines for gymnax π€β67Updated 2 years ago
- β19Updated 2 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ51Updated 2 years ago
- Vectorization techniques for fast population-based training.β56Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β55Updated 2 months ago
- An Open-Ended Agentic Simulatorβ51Updated 11 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated 10 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β107Updated last year
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β230Updated last month
- Simple JAX Graphics Library.β36Updated 8 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ85Updated 2 months ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ101Updated 8 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)β11Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β57Updated 2 years ago
- A collection of RL algorithms written in JAX.β100Updated 3 years ago
- Corax: Core RL in JAXβ38Updated last year
- Evaluating long-term memory of reinforcement learning algorithmsβ145Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ18Updated 8 months ago
- β44Updated 9 months ago
- An implementation of MuZero in JAX.β56Updated 2 years ago