maximilianigl / rl-iter
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆12Updated 4 years ago
Related projects: ⓘ
- ☆53Updated 6 months ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 4 years ago
- ☆41Updated 3 years ago
- ☆23Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- ☆41Updated 5 years ago
- ☆29Updated 3 years ago
- ☆59Updated 6 years ago
- ☆35Updated 2 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆17Updated last year
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆45Updated last year
- ☆17Updated 2 years ago
- ☆52Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆141Updated last year
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆142Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆15Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆26Updated 5 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 2 years ago
- Hindsight policy gradients☆42Updated 4 years ago
- ☆13Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- ☆95Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- ☆107Updated last year