dais-ita / interpretability-papers
Papers on interpretable deep learning, for review
☆29Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for interpretability-papers
- ☆61Updated last year
- Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]☆31Updated 4 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 5 years ago
- Code for "Testing Robustness Against Unforeseen Adversaries"☆80Updated 3 months ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆85Updated 4 years ago
- ☆35Updated last year
- OD-test: A Less Biased Evaluation of Out-of-Distribution (Outlier) Detectors (PyTorch)☆62Updated last year
- Geometric Certifications of Neural Nets☆41Updated 2 years ago
- Code for Fong and Vedaldi 2017, "Interpretable Explanations of Black Boxes by Meaningful Perturbation"☆30Updated 5 years ago
- ☆26Updated 5 years ago
- Figures & code from the paper "Shortcut Learning in Deep Neural Networks" (Nature Machine Intelligence 2020)☆94Updated 2 years ago
- Code for Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks☆31Updated 6 years ago
- Official repository for "Bridging Adversarial Robustness and Gradient Interpretability".☆30Updated 5 years ago
- Data, code & materials from the paper "Generalisation in humans and deep neural networks" (NeurIPS 2018)☆95Updated last year
- Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …☆16Updated 5 years ago
- Code for the paper 'Understanding Measures of Uncertainty for Adversarial Example Detection'☆57Updated 6 years ago
- Implementation of Information Dropout☆39Updated 7 years ago
- Interpreting neural networks via the STREAK algorithm (streaming weak submodular maximization)☆22Updated 7 years ago
- A way to achieve uniform confidence far away from the training data.☆36Updated 3 years ago
- ☆55Updated 4 years ago
- This repo provides code used in the paper "Predicting with High Correlation Features" (https://arxiv.org/abs/1910.00164):☆54Updated 4 years ago
- This repository is no longer maintained. Check☆82Updated 4 years ago
- Source code for paper Mroueh, Sercu, Rigotti, Padhi, dos Santos, "Sobolev Independence Criterion", NeurIPS 2019☆14Updated 5 months ago
- A pytorch implementation of our jacobian regularizer to encourage learning representations more robust to input perturbations.☆123Updated last year
- Release of CIFAR-10.1, a new test set for CIFAR-10.☆220Updated 4 years ago
- Pytorch Implementation of recent visual attribution methods for model interpretability☆145Updated 4 years ago
- Starter Kit for the NeurIPS 2019 Disentanglement Challenge☆33Updated 2 years ago
- Interpretation of Neural Network is Fragile☆36Updated 6 months ago
- Notebooks for reproducing the paper "Computer Vision with a Single (Robust) Classifier"☆128Updated 5 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆60Updated 3 years ago