microsoft / IBAC-SNI
Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck" by Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin and Katja Hofmann.
☆51Updated 4 years ago
Alternatives and similar repositories for IBAC-SNI
Users that are interested in IBAC-SNI are comparing it to the libraries listed below
Sorting:
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 2 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Updated 2 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- ☆42Updated 4 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆53Updated 5 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- ☆85Updated 9 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Efficient Exploration via State Marginal Matching (2019)☆68Updated 5 years ago
- ☆41Updated 3 years ago
- ☆53Updated last year
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆33Updated 4 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- ☆83Updated 4 years ago
- ☆43Updated 6 years ago
- ☆16Updated 9 months ago
- ☆45Updated last year
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated last year
- ☆31Updated 5 years ago
- impact-driven-exploration☆131Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆26Updated 2 years ago
- My Body Is A Cage☆40Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago