nico-bohlinger / RL-X
A framework for Reinforcement Learning research.
☆137Updated last month
Alternatives and similar repositories for RL-X:
Users that are interested in RL-X are comparing it to the libraries listed below
- ☆190Updated 2 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆134Updated 2 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆66Updated 8 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆123Updated this week
- Author's PyTorch implementation of TD7 for online and offline RL☆132Updated last year
- ☆215Updated 3 months ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆401Updated this week
- Partially Observable Process Gym☆178Updated 7 months ago
- Benchmarking RL generalization in an interpretable way.☆144Updated 3 weeks ago
- ☆260Updated 2 years ago
- ☆15Updated last year
- ☆253Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆69Updated 8 months ago
- A Simplified Pytorch Version of the Dreamer Algorithm☆120Updated last year
- Repo for Implicit Diffusion Q-Learning☆104Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆138Updated 3 months ago
- ☆73Updated 4 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆135Updated 10 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆126Updated this week
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆85Updated 7 months ago
- ☆48Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆216Updated last year
- Baselines for gymnax 🤖☆65Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆73Updated last year
- An API conversion tool for popular external reinforcement learning environments☆152Updated last month
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆74Updated 2 years ago
- Jax/Flax Implementation of TD-MPC2☆58Updated this week
- JAX implementation of RL algorithms and vectorized environments☆40Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago