IBM / model-sanitizationLinks

Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at ICLR 2020

☆27

Alternatives and similar repositories for model-sanitization

Users that are interested in model-sanitization are comparing it to the libraries listed below

Sorting:

Sandy-Zeng / NPAttack
Pytorch implementation of NPAttack
☆12Updated 5 years ago
ShawnXYang / AT_HE
☆35Updated 5 years ago
kyleliang919 / Uncovering-the-Connections-BetweenAdversarial-Transferability-and-Knowledge-Transferability
code for ICML 2021 paper in which we explore the relationship between adversarial transferability and knowledge transferability.
☆17Updated 3 years ago
VITA-Group / Trap-and-Replace-Backdoor-Defense
[NeurIPS'22] Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork. Haotao Wang, Junyuan Hong,…
☆15Updated 2 years ago
machanic / MetaAdvDet
The official pytorch implementation of ACM MM 19 paper "MetaAdvDet: Towards Robust Detection of Evolving Adversarial Attacks"
☆11Updated 4 years ago
ShawnXYang / AccumulativeAttack
☆19Updated 4 years ago
AI-secure / Robustness-Against-Backdoor-Attacks
RAB: Provable Robustness Against Backdoor Attacks
☆39Updated 2 years ago
VITA-Group / Random-Shuffling-BackdoorDetect
[NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…
☆20Updated 3 years ago
VITA-Group / Backdoor-LTH
[CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu C…
☆27Updated 3 years ago
goldblum / AdversariallyRobustDistillation
Pytorch implementation of Adversarially Robust Distillation (ARD)
☆59Updated 6 years ago
liuchen11 / AdversaryLossLandscape
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Updated 4 years ago
cassidylaidlaw / ReColorAdv
ReColorAdv and other attacks from the NeurIPS 2019 paper "Functional Adversarial Attacks"
☆38Updated 3 years ago
Cold-Winter / Nattack
☆48Updated 4 years ago
ash-aldujaili / blackbox-adv-examples-signhunter
A repository for the query-efficient black-box attack, SignHunter
☆23Updated 5 years ago
tml-epfl / adv-training-corruptions
On the effectiveness of adversarial training against common corruptions [UAI 2022]
☆30Updated 3 years ago
UMBCvision / Universal-Litmus-Patterns
Official Repository for the CVPR 2020 paper "Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs"
☆44Updated 2 years ago
cleverhans-lab / dataset-inference
[ICLR'21] Dataset Inference for Ownership Resolution in Machine Learning
☆32Updated 3 years ago
jiaxiaojunQAQ / FGSM-SDI
Code for Boosting fast adversarial training with learnable adversarial initialization (TIP2022)
☆29Updated 2 years ago
zhuchen03 / ConvexPolytopePosioning
ConvexPolytopePosioning
☆37Updated 5 years ago
locuslab / breaking-poisoned-classifier
Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"
☆26Updated 3 years ago
val-iisc / GAMA-GAT
Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020
☆27Updated 5 years ago
TLMichael / Delusive-Adversary
[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training
☆32Updated 3 years ago
superrrpotato / Defending-Neural-Backdoors-via-Generative-Distribution-Modeling
The code is for our NeurIPS 2019 paper: https://arxiv.org/abs/1910.04749
☆34Updated 5 years ago
DequanWang / dent
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks
☆39Updated 4 years ago
VITA-Group / Alleviate-Robust-Overfitting
[ICLR 2021] "Robust Overfitting may be mitigated by properly learned smoothening" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, Shiyu Chan…
☆48Updated 4 years ago
THUYimingLi / Semi-supervised_Robust_Training
This is the code for semi-supervised robust training (SRT).
☆18Updated 2 years ago
wangjksjtu / minmax-adv
Code for "Adversarial Attack Generation Empowered by Min-Max Optimization", NeurIPS 2021
☆19Updated 4 years ago
DennisLiu2022 / Membership-Inference-Attacks-by-Exploiting-Loss-Trajectory
☆24Updated 3 years ago
amiratag / neuronshapley
Code for "Neuron Shapley: Discovering the Responsible Neurons"
☆27Updated last year
UMBCvision / SSL-Backdoor
Official implementation of the CVPR 2022 paper "Backdoor Attacks on Self-Supervised Learning".
☆76Updated 2 years ago