amiratag / InterpretationFragilityLinks

Interpretation of Neural Network is Fragile

☆36

Alternatives and similar repositories for InterpretationFragility

Users that are interested in InterpretationFragility are comparing it to the libraries listed below

Sorting:

xuanqing94 / BayesianDefense
Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network
☆62Updated 6 years ago
dtak / adversarial-robustness-public
Code for AAAI 2018 accepted paper: "Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing the…
☆55Updated 2 years ago
mndu / guided-feature-inversion
PyTorch code for KDD 18 paper: Towards Explanation of DNN-based Prediction with Guided Feature Inversion
☆21Updated 6 years ago
dcmoyer / inv-rep
Code for Invariant Rep. Without Adversaries (NIPS 2018)
☆35Updated 5 years ago
marcoancona / DASP
Keras implementation for DASP: Deep Approximate Shapley Propagation (ICML 2019)
☆61Updated 6 years ago
max-andr / provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
☆32Updated 5 years ago
csinva / hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
☆129Updated 3 years ago
max-andr / provably-robust-boosting
Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]
☆50Updated 5 years ago
ZiangYan / deepdefense.pytorch
Implementation of our NeurIPS 2018 paper: Deep Defense: Training DNNs with Improved Adversarial Robustness
☆39Updated 6 years ago
kohpangwei / data-poisoning-release
☆32Updated 7 years ago
ftramer / MultiRobustness
Code for the paper "Adversarial Training and Robustness for Multiple Perturbations", NeurIPS 2019
☆47Updated 2 years ago
zzzace2000 / FIDO-saliency
Explaining Image Classifiers by Counterfactual Generation
☆28Updated 3 years ago
hendrycks / pre-training
Pre-Training Buys Better Robustness and Uncertainty Estimates (ICML 2019)
☆100Updated 3 years ago
davidstutz / disentangling-robustness-generalization
CVPR'19 experiments with (on-manifold) adversarial examples.
☆45Updated 5 years ago
lsgos / uncertainty-adversarial-paper
Code for the paper 'Understanding Measures of Uncertainty for Adversarial Example Detection'
☆61Updated 7 years ago
tonyduan / rs4a
Randomized Smoothing of All Shapes and Sizes (ICML 2020).
☆52Updated 5 years ago
rmrisforbidden / Fooling_Neural_Network-Interpretations
This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…
☆22Updated 4 years ago
MadryLab / robust_representations
Code for "Learning Perceptually-Aligned Representations via Adversarial Robustness"
☆161Updated 5 years ago
laura-rieger / deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" ht…
☆128Updated 4 years ago
locuslab / projected_sinkhorn
☆88Updated last year
Hadisalman / smoothing-adversarial
Code for our NeurIPS 2019 *spotlight* "Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers"
☆227Updated 5 years ago
ruthcfong / perturb_explanations
Code for Fong and Vedaldi 2017, "Interpretable Explanations of Black Boxes by Meaningful Perturbation"
☆31Updated 5 years ago
P2333 / Max-Mahalanobis-Training
Max Mahalanobis Training (ICML 2018 + ICLR 2020)
☆90Updated 4 years ago
ruthcfong / net2vec
Code for Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks
☆31Updated 7 years ago
s-huu / TurningWeaknessIntoStrength
Official implementation for paper: A New Defense Against Adversarial Images: Turning a Weakness into a Strength
☆38Updated 5 years ago
MadryLab / robust-features-code
Code for "Robustness May Be at Odds with Accuracy"
☆91Updated 2 years ago
hendrycks / fooling
Code for the Adversarial Image Detectors and a Saliency Map
☆12Updated 8 years ago
MadryLab / constructed-datasets
Datasets for the paper "Adversarial Examples are not Bugs, They Are Features"
☆188Updated 4 years ago
cetmann / robustness-interpretability
Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …
☆16Updated 6 years ago
rfeinman / detecting-adversarial-samples
Code for "Detecting Adversarial Samples from Artifacts" (Feinman et al., 2017)
☆111Updated 7 years ago