instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆56Updated last year
Alternatives and similar repositories for sebulba:
Users that are interested in sebulba are comparing it to the libraries listed below
- A collection of matrix games in JAX☆9Updated 3 months ago
- ☆73Updated 3 months ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆72Updated 6 months ago
- General Modules for JAX☆64Updated this week
- An Open-Ended Agentic Simulator☆41Updated 6 months ago
- Baselines for gymnax 🤖☆65Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆138Updated 3 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆225Updated this week
- ☆18Updated last month
- Accelerated minigrid environments with JAX☆130Updated 7 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated 2 years ago
- Simple JAX Graphics Library.☆34Updated 3 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]