laura-rieger / deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584
β127Updated 4 years ago
Alternatives and similar repositories for deep-explanation-penalization:
Users that are interested in deep-explanation-penalization are comparing it to the libraries listed below
- Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" π§ (ICLR 2019)β128Updated 3 years ago
- Tools for training explainable models using attribution priors.β123Updated 4 years ago
- β109Updated 2 years ago
- β51Updated 4 years ago
- Towards Automatic Concept-based Explanationsβ159Updated 11 months ago
- Figures & code from the paper "Shortcut Learning in Deep Neural Networks" (Nature Machine Intelligence 2020)β96Updated 2 years ago
- β134Updated 5 years ago
- Causal Explanation (CXPlain) is a method for explaining the predictions of any machine-learning model.β130Updated 4 years ago
- Codebase for "Deep Learning for Case-based Reasoning through Prototypes: A Neural Network that Explains Its Predictions" (to appear in AAβ¦β74Updated 7 years ago
- reference implementation for "explanations can be manipulated and geometry is to blame"β36Updated 2 years ago
- β125Updated 3 years ago
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networksβ49Updated 4 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"β53Updated 3 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 forβ¦β25Updated 3 years ago
- Interpretation of Neural Network is Fragileβ36Updated 11 months ago
- Keras implementation for DASP: Deep Approximate Shapley Propagation (ICML 2019)β61Updated 5 years ago
- Toolkit for building machine learning models that generalize to unseen domains and are robust to privacy and other attacks.β175Updated last year
- Code for Fong and Vedaldi 2017, "Interpretable Explanations of Black Boxes by Meaningful Perturbation"β30Updated 5 years ago
- A lightweight implementation of removal-based explanations for ML models.β59Updated 3 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893β86Updated 5 years ago
- Implementation of Layerwise Relevance Propagation for heatmapping "deep" layersβ98Updated 6 years ago
- Algorithms for abstention, calibration and domain adaptation to label shift.β36Updated 4 years ago
- Original dataset release for CIFAR-10Hβ82Updated 4 years ago
- Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, β¦β113Updated 6 years ago
- β50Updated 2 years ago
- Explaining Image Classifiers by Counterfactual Generationβ28Updated 2 years ago
- Self-Explaining Neural Networksβ40Updated 5 years ago
- Self-Explaining Neural Networksβ13Updated last year
- Code/figures in Right for the Right Reasonsβ55Updated 4 years ago
- β62Updated 3 years ago