microsoft/IBAC-SNI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/IBAC-SNI)

microsoft / IBAC-SNI

Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck" by Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin and Katja Hofmann.

☆52

Alternatives and similar repositories for IBAC-SNI

Users that are interested in IBAC-SNI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago
rraileanu / auto-drac
View on GitHub
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆104Mar 24, 2023Updated 3 years ago
pokaxpoka / netrand
View on GitHub
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆57Apr 27, 2020Updated 6 years ago
kaixin96 / mixreg
View on GitHub
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆34Oct 22, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lili-chen / SEER
View on GitHub
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Mar 5, 2021Updated 5 years ago
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
snu-mllab / DCPG
View on GitHub
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
☆15Feb 20, 2023Updated 3 years ago
openai / coinrun
View on GitHub
Code for the paper "Quantifying Transfer in Reinforcement Learning"
☆405Oct 7, 2023Updated 2 years ago
rraileanu / idaac
View on GitHub
☆55Feb 28, 2024Updated 2 years ago
microsoft / Partner-app-development
View on GitHub
Samples for partner application development (OEM, MO, IHV) for Window
☆18Jun 12, 2023Updated 3 years ago
jesbu1 / carl
View on GitHub
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
☆14Nov 22, 2022Updated 3 years ago
hzm2016 / option-critic-pytorch
View on GitHub
☆15Nov 21, 2022Updated 3 years ago
microsoft / OpenMSFTL
View on GitHub
Research simulation toolkit for federated learning
☆13Nov 7, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TrentBrick / RewardConditionedUDRL
View on GitHub
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆19Mar 10, 2021Updated 5 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
kaixin96 / rl-generalization-paper
View on GitHub
A list of papers regarding generalization in (deep) reinforcement learning
☆156Aug 12, 2023Updated 2 years ago
microsoft / TextNAS
View on GitHub
This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…
☆15Nov 28, 2022Updated 3 years ago
DRL-CASIA / Deep-Reinforcement-Learning
View on GitHub
☆18Jan 4, 2021Updated 5 years ago
microsoft / smart
View on GitHub
Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"
☆54Jan 26, 2024Updated 2 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
facebookresearch / icp-block-mdp
View on GitHub
Invariant Causal Prediction for Block MDPs
☆44Jun 11, 2020Updated 6 years ago
BillyWM / Infinite-Mario
View on GitHub
Tweaks to Flash ver. of Notch's Infinite Mario Bros: Working score, levels that stay after death, etc
☆12Jul 27, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
microsoft / wmg_agent
View on GitHub
WMG agent
☆34Oct 3, 2023Updated 2 years ago
microsoft / cognitive-research-technologies-docs
View on GitHub
Documentation related to Microsoft Cognitive Research Technologies
☆21Oct 6, 2022Updated 3 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
Bluedotdot2021 / PRML-book_review
View on GitHub
PRML Page-by-page配套资料，对PRML全书及各章节的review
☆17Apr 16, 2024Updated 2 years ago
ryoungj / ZO-L2L
View on GitHub
[ICLR'20] Learning to Learn by Zeroth-Order Oracle
☆14Feb 7, 2020Updated 6 years ago
microsoft / MWSS
View on GitHub
Early Detection of Fake News with Multi-source Weak Social Supervision
☆24Jun 12, 2023Updated 3 years ago
stanford-iris-lab / batch-exploration
View on GitHub
☆12Apr 25, 2022Updated 4 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Pandede / FDRNN
View on GitHub
Deep direct reinforcement learning for financial signal representation and trading
☆31Oct 7, 2020Updated 5 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
BorealisAI / pommerman-baseline
View on GitHub
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆38May 9, 2019Updated 7 years ago
pokaxpoka / rad_procgen
View on GitHub
RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)
☆19Mar 29, 2021Updated 5 years ago
Cranial-XIX / metric-residual-network
View on GitHub
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆20Jan 11, 2023Updated 3 years ago
Lifelong-ML / LPG-FTW
View on GitHub
☆20Jun 14, 2022Updated 4 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago