rmrisforbidden / Fooling_Neural_Network-Interpretations
This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our paper has been accepted to NeurIPS 2019.
☆22Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Fooling_Neural_Network-Interpretations
- reference implementation for "explanations can be manipulated and geometry is to blame"☆35Updated 2 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"☆51Updated 2 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆25Updated 2 years ago
- Interpretation of Neural Network is Fragile☆36Updated 6 months ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- Codes for reproducing the experimental results in "Proper Network Interpretability Helps Adversarial Robustness in Classification", publi…☆13Updated 4 years ago
- Understanding and Improving Fast Adversarial Training [NeurIPS 2020]☆94Updated 3 years ago
- PyTorch implementations of Adversarial defenses and utils.☆34Updated 10 months ago
- Pre-Training Buys Better Robustness and Uncertainty Estimates (ICML 2019)☆99Updated 2 years ago
- Code for the paper "MMA Training: Direct Input Space Margin Maximization through Adversarial Training"☆34Updated 4 years ago
- CVPR'19 experiments with (on-manifold) adversarial examples.☆44Updated 4 years ago
- Code for AAAI 2018 accepted paper: "Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing the…☆54Updated last year
- Pytorch implementation of Adversarially Robust Distillation (ARD)☆59Updated 5 years ago
- Detection of adversarial examples using influence functions and nearest neighbors☆32Updated last year
- Semisupervised learning for adversarial robustness https://arxiv.org/pdf/1905.13736.pdf☆137Updated 4 years ago
- A way to achieve uniform confidence far away from the training data.☆36Updated 3 years ago
- Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]☆31Updated 4 years ago
- Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks, in ICCV 2019☆59Updated 5 years ago
- the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral☆57Updated 3 years ago
- Adversarial Defense for Ensemble Models (ICML 2019)☆61Updated 3 years ago
- Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".☆54Updated 2 years ago
- ☆109Updated 2 years ago
- A PyTorch implementation of the method found in "Adversarially Robust Few-Shot Learning: A Meta-Learning Approach"☆49Updated 4 years ago
- A Closer Look at Accuracy vs. Robustness☆88Updated 3 years ago
- ☆15Updated 4 years ago
- Code for "Neuron Shapley: Discovering the Responsible Neurons"☆23Updated 6 months ago
- Max Mahalanobis Training (ICML 2018 + ICLR 2020)☆89Updated 3 years ago
- Original dataset release for CIFAR-10H☆82Updated 4 years ago
- Code for the paper "Adversarial Training and Robustness for Multiple Perturbations", NeurIPS 2019☆46Updated last year
- Source code for the paper "Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness"☆26Updated 4 years ago