HanxunH/Detect-CLIP-Backdoor-Samples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HanxunH/Detect-CLIP-Backdoor-Samples)

HanxunH / Detect-CLIP-Backdoor-Samples

[ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining

☆19

Alternatives and similar repositories for Detect-CLIP-Backdoor-Samples

Users that are interested in Detect-CLIP-Backdoor-Samples are comparing it to the libraries listed below

Sorting:

yjkim721 / STRIP-ViTA
View on GitHub
This work corroborates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs, is a multi-domain Trojan …
☆10Mar 7, 2021Updated 4 years ago
XuankunRong / BYE
View on GitHub
[NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning
☆17Oct 13, 2025Updated 4 months ago
Megum1 / UNIT
View on GitHub
[ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
☆10Dec 18, 2025Updated 2 months ago
Zhang-Henry / INACTIVE
View on GitHub
The official implementation of CVPR 2025 paper "Invisible Backdoor Attack against Self-supervised Learning"
☆17Jul 5, 2025Updated 7 months ago
Unispac / Fight-Poison-With-Poison
View on GitHub
Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples
☆30Jul 11, 2023Updated 2 years ago
hkunzhe / label_consistent_attacks_pytorch
View on GitHub
A minimal PyTorch implementation of Label-Consistent Backdoor Attacks
☆29Feb 8, 2021Updated 5 years ago
HanxunH / CognitiveDistillation
View on GitHub
[ICLR2023] Distilling Cognitive Backdoor Patterns within an Image
☆36Oct 29, 2025Updated 4 months ago
JWLiang007 / PFF
View on GitHub
Official implementation of "Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection" (ICLR 2024)
☆18Apr 15, 2024Updated last year
linweiii / BackdoorDM
View on GitHub
[NeurIPS 2025 D&B] BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model
☆24Aug 1, 2025Updated 7 months ago
PurduePAML / DBS
View on GitHub
☆18Aug 15, 2022Updated 3 years ago
sycny / ZIP
View on GitHub
[NeurIPS2023] Black-box Backdoor Defense via Zero-shot Image Purification
☆16Oct 31, 2023Updated 2 years ago
Cinofix / sponge_poisoning_energy_latency_attack
View on GitHub
Source code for the Energy-Latency Attacks via Sponge Poisoning paper.
☆15Mar 14, 2022Updated 3 years ago
uw-nsl / CleanGen
View on GitHub
[EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
☆19Mar 9, 2025Updated 11 months ago
grasses / PoisonPrompt
View on GitHub
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆20Aug 10, 2024Updated last year
Unispac / Circumventing-Backdoor-Defenses
View on GitHub
Code Repository for the Paper ---Revisiting the Assumption of Latent Separability for Backdoor Defenses (ICLR 2023)
☆47Feb 28, 2023Updated 3 years ago
lancopku / Embedding-Poisoning
View on GitHub
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆44Jul 26, 2021Updated 4 years ago
SewoongLab / spectre-defense
View on GitHub
Defending Against Backdoor Attacks Using Robust Covariance Estimation
☆22Jul 12, 2021Updated 4 years ago
tianshuocong / TePA
View on GitHub
[S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models
☆19Feb 18, 2025Updated last year
VITA-Group / Random-Shuffling-BackdoorDetect
View on GitHub
[NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…
☆21Oct 1, 2022Updated 3 years ago
jianshuod / TBA
View on GitHub
Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''
☆20Aug 9, 2023Updated 2 years ago
zhanglijun95 / ZoDiac
View on GitHub
Attack-Resilient Image Watermarking Using Stable Diffusion (NeurIPS2024)
☆57Dec 5, 2024Updated last year
sail-sg / MMCBench
View on GitHub
☆27Jan 23, 2024Updated 2 years ago
IBM / VillanDiffusion
View on GitHub
Code Repo for the NeurIPS 2023 paper "VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models"
☆27Sep 18, 2025Updated 5 months ago
Vinsonzyh / BlueSuffix
View on GitHub
[ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
☆30Nov 2, 2025Updated 3 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆59Jun 5, 2024Updated last year
kenny-co / sgd-uap-torch
View on GitHub
Universal Adversarial Perturbations (UAPs) for PyTorch
☆49Aug 28, 2021Updated 4 years ago
lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
meet-cjli / CTRL
View on GitHub
An Embarrassingly Simple Backdoor Attack on Self-supervised Learning
☆20Jan 24, 2024Updated 2 years ago
zaixizhang / CBD
View on GitHub
Official Inplementation of CVPR23 paper "Backdoor Defense via Deconfounded Representation Learning"
☆25Mar 13, 2023Updated 2 years ago
lancopku / SOS
View on GitHub
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
☆24Dec 9, 2021Updated 4 years ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
dwDavidxd / CAFD
View on GitHub
Removing Adversarial Noise in Class Activation Feature Space
☆14Oct 12, 2023Updated 2 years ago
LiangSiyuan21 / BadCLIP
View on GitHub
☆30Sep 3, 2024Updated last year
thunlp / NeuBA
View on GitHub
☆25Jun 23, 2021Updated 4 years ago
steven202 / semantic_adv_via_dm
View on GitHub
[BMVC 2023] Semantic Adversarial Attacks via Diffusion Models
☆24Nov 30, 2023Updated 2 years ago
jiamingzhang94 / Adversarial-Prompt-Tuning
View on GitHub
ECCV2024: Adversarial Prompt Tuning for Vision-Language Models
☆31Nov 19, 2024Updated last year
lvpeizhuo / MEA-Defender
View on GitHub
This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.
☆29Nov 19, 2023Updated 2 years ago
Gwinhen / AmI
View on GitHub
This repository is for NeurIPS 2018 spotlight paper "Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples."
☆31Apr 27, 2022Updated 3 years ago
inistory / STONE-watermarking
View on GitHub
Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL …
☆12Feb 11, 2026Updated 2 weeks ago