nico-bohlinger / RL-X
A framework for Reinforcement Learning research.
☆98Updated 2 weeks ago
Related projects: ⓘ
- ☆192Updated 7 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Partially Observable Process Gym☆158Updated 2 months ago
- ☆141Updated 2 weeks ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆66Updated 9 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- A Simplified Pytorch Version of the Dreamer Algorithm☆105Updated last year
- ☆15Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆125Updated this week
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆189Updated 2 weeks ago
- Prioritized Experience Replay implementation with proportional prioritization☆67Updated last year
- An API conversion tool for popular external reinforcement learning environments☆131Updated 3 months ago
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- Clean single-file implementation of offline RL algorithms in JAX☆86Updated last month
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆195Updated 3 weeks ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- Synthetic Experience Replay☆62Updated 3 months ago
- ☆226Updated 2 years ago
- Baselines for gymnax 🤖☆57Updated last year
- ☆56Updated 3 weeks ago
- ☆201Updated last year
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Deep Hierarchical Planning from Pixels☆85Updated last year