vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆105Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for cleanba
- ☆65Updated 3 weeks ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- ☆63Updated 3 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆104Updated 3 months ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆208Updated last month
- ☆149Updated this week
- Baselines for gymnax 🤖☆60Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- General Modules for JAX☆59Updated 3 months ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- Conservative Q learning in Jax☆51Updated last year
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆59Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- Benchmarking RL generalization in an interpretable way.☆133Updated 9 months ago
- ☆201Updated this week
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆78Updated 3 months ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆95Updated this week
- Corax: Core RL in JAX☆35Updated 9 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆49Updated last year
- An API conversion tool for popular external reinforcement learning environments☆139Updated last month