vwxyzjn / cleanbaLinks
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
β117Updated last year
Alternatives and similar repositories for cleanba
Users that are interested in cleanba are comparing it to the libraries listed below
Sorting:
- Baselines for gymnax π€β72Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- Accelerated replay buffers in JAXβ44Updated 3 years ago
- β87Updated 2 months ago
- β87Updated last year
- General Modules for JAXβ71Updated 2 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ104Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β136Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β103Updated 3 years ago
- An implementation of MuZero in JAX.β57Updated 3 years ago
- Extreme Q-Learning: Max Entropy RL without Entropyβ87Updated 2 years ago
- β46Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated last year
- Evaluating long-term memory of reinforcement learning algorithmsβ151Updated 2 years ago
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ212Updated last week
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)β127Updated last year
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ60Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β110Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β98Updated 2 years ago
- Accelerated minigrid environments with JAXβ152Updated last month
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β61Updated last month
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to eβ¦β84Updated last year
- Corax: Core RL in JAXβ38Updated last year
- Conservative Q learning in Jaxβ55Updated 2 years ago
- β241Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learningβ117Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"β57Updated last year