microsoft / IBAC-SNILinks
Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck" by Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin and Katja Hofmann.
☆50Updated 5 years ago
Alternatives and similar repositories for IBAC-SNI
Users that are interested in IBAC-SNI are comparing it to the libraries listed below
Sorting:
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- ☆87Updated last year
- Revisiting Rainbow☆75Updated 4 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆127Updated 4 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 3 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 5 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆54Updated 5 years ago
- ☆43Updated 6 years ago
- ☆54Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated 2 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆33Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- ☆43Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆112Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- ☆26Updated 2 years ago
- Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.☆113Updated 4 years ago
- ☆84Updated 4 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆150Updated 4 years ago
- Soft Actor-Critic☆153Updated 7 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated 2 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Updated 4 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 5 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆38Updated 6 years ago