Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck" by Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin and Katja Hofmann.
☆52Jun 28, 2020Updated 5 years ago
Alternatives and similar repositories for IBAC-SNI
Users that are interested in IBAC-SNI are comparing it to the libraries listed below
Sorting:
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 2 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆34Oct 22, 2020Updated 5 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆14Feb 20, 2023Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆409Oct 7, 2023Updated 2 years ago
- ☆55Feb 28, 2024Updated 2 years ago
- Samples for partner application development (OEM, MO, IHV) for Window☆18Jun 12, 2023Updated 2 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- Research simulation toolkit for federated learning☆13Nov 7, 2020Updated 5 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 5 years ago
- Dynamic Measurement Scheduling for Event Forecasting using Deep RL (ICML 2019)☆10Jun 16, 2020Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- ☆14Nov 21, 2022Updated 3 years ago
- This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…☆15Nov 28, 2022Updated 3 years ago
- ☆18Jan 4, 2021Updated 5 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 5 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Tweaks to Flash ver. of Notch's Infinite Mario Bros: Working score, levels that stay after death, etc☆12Jul 27, 2021Updated 4 years ago
- WMG agent☆34Oct 3, 2023Updated 2 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Documentation related to Microsoft Cognitive Research Technologies☆21Oct 6, 2022Updated 3 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated last year
- Early Detection of Fake News with Multi-source Weak Social Supervision☆23Jun 12, 2023Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- ☆13Jul 25, 2023Updated 2 years ago
- Deep direct reinforcement learning for financial signal representation and trading☆32Oct 7, 2020Updated 5 years ago
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago