vwxyzjn / cleanbaLinks
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
β120Updated last year
Alternatives and similar repositories for cleanba
Users that are interested in cleanba are comparing it to the libraries listed below
Sorting:
- Baselines for gymnax π€β74Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- β91Updated 3 months ago
- β89Updated last year
- Accelerated replay buffers in JAXβ46Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β103Updated 3 years ago
- General Modules for JAXβ72Updated 3 months ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ74Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agentsβ107Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU settingβ231Updated last month
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- An implementation of MuZero in JAX.β58Updated 3 years ago
- Extreme Q-Learning: Max Entropy RL without Entropyβ87Updated 2 years ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ61Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β98Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β138Updated last year
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β112Updated 2 years ago
- β46Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to eβ¦β85Updated 2 years ago
- Accelerated minigrid environments with JAXβ154Updated 2 months ago
- Efficient baselines for autocurricula in JAX.β205Updated last year
- Evaluating long-term memory of reinforcement learning algorithmsβ160Updated 2 years ago
- Corax: Core RL in JAXβ38Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learningβ17Updated 3 years ago
- β52Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β62Updated last week
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ82Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β92Updated 4 years ago