timoklein / redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆22Updated 2 months ago
Related projects: ⓘ
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- ☆12Updated 4 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- A PyTorch implementation of Implicit Q-Learning☆66Updated 2 years ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆108Updated 2 years ago
- Conservative Q learning in Jax☆49Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Conservative Q Learning on top of SAC☆118Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆41Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- ☆46Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆68Updated 2 years ago
- ☆51Updated last year
- Transformer-based World Models☆66Updated last year
- Synthetic Experience Replay☆62Updated 3 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Representation Learning for RL☆110Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆15Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- ☆43Updated 3 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 5 months ago
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- Deep Hierarchical Planning from Pixels☆85Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago