timoklein / redoLinks
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆28Updated 9 months ago
Alternatives and similar repositories for redo
Users that are interested in redo are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of TD7 for online and offline RL☆146Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆104Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆104Updated last year
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆179Updated 2 months ago
- ☆49Updated 3 weeks ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆30Updated 2 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- ☆103Updated 5 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆69Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆26Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated 2 years ago
- ☆279Updated 3 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆214Updated last month
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 5 months ago
- Synthetic Experience Replay☆96Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Simple maze environments using mujoco-py☆57Updated last year
- A PyTorch implementation of Implicit Q-Learning☆83Updated 3 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆19Updated last year
- ☆15Updated 2 years ago
- Transformer-based World Models☆85Updated 2 years ago
- Official implementation of the BRO algorithm☆48Updated 6 months ago
- Benchmarking RL generalization in an interpretable way.☆159Updated last month
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆86Updated 8 months ago
- Representation Learning for RL☆126Updated 2 years ago
- Conservative Q Learning on top of SAC☆132Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆129Updated 3 years ago
- Skeleton for scalable and flexible Jax RL implementations☆84Updated 2 years ago