damon-demon / Black-Box-Defense
View external linksLinks

Robustify Black-Box Models (ICLR'22 - Spotlight)

☆24

Alternatives and similar repositories for Black-Box-Defense

Users that are interested in Black-Box-Defense are comparing it to the libraries listed below

Sorting:

xpf / Data-Efficient-Backdoor-Attacks
View on GitHub
Data-Efficient Backdoor Attacks
☆20Jun 15, 2022Updated 3 years ago
konpanousis / Adversarial-LWTA-AutoAttack
View on GitHub
☆12May 6, 2022Updated 3 years ago
Ekko-zn / IJCAI2022-Backdoor
View on GitHub
☆20May 6, 2022Updated 3 years ago
VITA-Group / Random-Shuffling-BackdoorDetect
View on GitHub
[NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…
☆21Oct 1, 2022Updated 3 years ago
AI-secure / Robustness-Against-Backdoor-Attacks
View on GitHub
RAB: Provable Robustness Against Backdoor Attacks
☆39Oct 3, 2023Updated 2 years ago
VITA-Group / Backdoor-LTH
View on GitHub
[CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu C…
☆27Oct 5, 2022Updated 3 years ago
aptsunny / Ensemble-One-Shot-NAS
View on GitHub
Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.
☆12Apr 17, 2020Updated 5 years ago
bboylyg / ABL
View on GitHub
Anti-Backdoor learning (NeurIPS 2021)
☆83Jul 20, 2023Updated 2 years ago
hannxu123 / fair_robust
View on GitHub
☆11Apr 27, 2022Updated 3 years ago
dwDavidxd / MIAT
View on GitHub
Improving Adversarial Robustness via Mutual Information Estimation
☆11Apr 2, 2024Updated last year
VITA-Group / Sparsity-Win-Robust-Generalization
View on GitHub
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…
☆40Mar 20, 2022Updated 3 years ago
thinwayliu / Watermark-Vaccine
View on GitHub
The code for ECCV2022 (Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal)
☆44Oct 1, 2022Updated 3 years ago
OPTML-Group / ILM-VP
View on GitHub
[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zha…
☆53Sep 17, 2023Updated 2 years ago
Guneet-Dhillon / Stochastic-Activation-Pruning
View on GitHub
☆19Mar 5, 2018Updated 7 years ago
ShawnXYang / AccumulativeAttack
View on GitHub
☆19Jun 21, 2021Updated 4 years ago
sunblaze-ucb / REFIT
View on GitHub
☆27Oct 17, 2022Updated 3 years ago
yuplin2333 / representation-space-jailbreak
View on GitHub
Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…
☆23Jul 26, 2024Updated last year
YiZeng623 / I-BAU
View on GitHub
Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''
☆53Nov 16, 2022Updated 3 years ago
pantheon5100 / DeACL
View on GitHub
This is the official implementation of the paper "Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness,"…
☆19Jul 15, 2024Updated last year
KaihuaTang / CiiV-Adversarial-Robustness.pytorch
View on GitHub
The official PyTorch Implementation of the Paper "Adversarial Visual Robustness by Causal Intervention"
☆18Oct 6, 2021Updated 4 years ago
ZhengyuZhao / Targeted-Transfer
View on GitHub
Simple yet effective targeted transferable attack (NeurIPS 2021)
☆51Nov 17, 2022Updated 3 years ago
shizhouxing / Fast-Certified-Robust-Training
View on GitHub
[NeurIPS 2021] Fast Certified Robust Training with Short Warmup
☆25Jun 7, 2025Updated 8 months ago
Gwinhen / PixelBackdoor
View on GitHub
This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."
☆24Apr 5, 2022Updated 3 years ago
amiratag / neuronshapley
View on GitHub
Code for "Neuron Shapley: Discovering the Responsible Neurons"
☆27May 1, 2024Updated last year
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆31Jan 10, 2025Updated last year
MadryLab / label-consistent-backdoor-code
View on GitHub
Code for "Label-Consistent Backdoor Attacks"
☆57Nov 22, 2020Updated 5 years ago
AI-secure / Meta-Nerual-Trojan-Detection
View on GitHub
☆68Sep 29, 2020Updated 5 years ago
nblt / TWA
View on GitHub
[ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions
☆27Feb 11, 2025Updated last year
jiaxiaojunQAQ / LAS-AT
View on GitHub
Code for LAS-AT: Adversarial Training with Learnable Attack Strategy (CVPR2022)
☆118Mar 30, 2022Updated 3 years ago
jiaxiaojunQAQ / FGSM-SDI
View on GitHub
Code for Boosting fast adversarial training with learnable adversarial initialization (TIP2022)
☆29Aug 22, 2023Updated 2 years ago
shizhouxing / Robustness-Verification-for-Transformers
View on GitHub
[ICLR 2020] Code for paper "Robustness Verification for Transformers"
☆27Nov 26, 2024Updated last year
amunn / msu-thesis
View on GitHub
Thesis Class for Michigan State University
☆38Jul 8, 2024Updated last year
inspire-group / proxy-distributions
View on GitHub
[ICLR 2022 official code] Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?
☆29Mar 15, 2022Updated 3 years ago
sani903 / OpenAgentSafety
View on GitHub
A Framework for Evaluating AI Agent Safety in Realistic Environments
☆30Oct 2, 2025Updated 4 months ago
DequanWang / dent
View on GitHub
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks
☆38May 25, 2021Updated 4 years ago
skmda37 / CartoonX
View on GitHub
Code repository for the ECCV 2022 (Oral) paper "Cartoon Explanations of Image Classifiers"
☆10Nov 24, 2025Updated 2 months ago
LUMIA-Group / PonderingLM
View on GitHub
Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"
☆24Jul 21, 2025Updated 6 months ago
liecn / cnli.me
View on GitHub
template for https://cnli.me
☆10Feb 27, 2025Updated 11 months ago
SALT-NLP / PopupAttack
View on GitHub
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
☆50Dec 23, 2024Updated last year

damon-demon / Black-Box-DefenseView external linksLinks

Alternatives and similar repositories for Black-Box-Defense

damon-demon / Black-Box-Defense
View external linksLinks