jiaxiaojunQAQ / FOA-AttackLinks

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment (NeurIPS 2025)

☆43

Alternatives and similar repositories for FOA-Attack

Users that are interested in FOA-Attack are comparing it to the libraries listed below

Sorting:

MaTengSYSU / HIMRD-jailbreak
Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"
☆14Updated 4 months ago
jiamingzhang94 / AnyAttack
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆61Updated 4 months ago
HanxunH / XTransferBench
[ICML 2025] X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
☆33Updated 6 months ago
huanranchen / VLMTransfer
A package that achieves 95%+ transfer attack success rate against GPT-4
☆25Updated last year
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆55Updated last year
huanranchen / AdversarialAttacks
☆79Updated last year
KuofengGao / Verbose_Images
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆42Updated last year
Haochen-Luo / CroPA
☆54Updated last year
jiawangbai / BadCLIP
Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf
☆23Updated last year
ZhentingWang / DIAGNOSIS
☆23Updated last year
TeamPigeonLab / CS-DJ
Accept by CVPR 2025 (highlight)
☆21Updated 6 months ago
thinwayliu / Multimodal-Unlearnable-Examples
The code for ACM MM2024 (Multimodal Unlearnable Examples: Protecting Data against Multimodal Contrastive Learning)
☆15Updated last year
thu-ml / Attack-Bard
☆107Updated last year
ericyinyzy / VLAttack
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…
☆63Updated 9 months ago
RUCAIBox / HADES
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆33Updated last year
gq-max / AdvDiffVLM
☆47Updated 8 months ago
researchcode001 / daca
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆18Updated 10 months ago
datar001 / Awesome-AD-on-T2IDM
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆87Updated last week
jiaxiaojunQAQ / OmniSafeBench-MM
A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation
☆46Updated last week
WUSTL-CSPL / RIATIG
☆28Updated 2 years ago
zhaisf / BadT2I
[MM'23 Oral] "Text-to-image diffusion models can be easily backdoored through multimodal data poisoning"
☆31Updated 4 months ago
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Updated last year
umd-huang-lab / VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆58Updated 11 months ago
YitingQu / unsafe-diffusion
☆42Updated last year
erfanshayegani / Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆77Updated last year
serendipity1122 / Pre-trained-Model-Guided-Fine-Tuning-for-Zero-Shot-Adversarial-Robustness
Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》
☆25Updated last year
adversarial-for-goodness / Co-Attack
official PyTorch implement of Towards Adversarial Attack on Vision-Language Pre-training Models
☆65Updated 2 years ago
Zoky-2020 / SGA
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]
☆68Updated 2 years ago
LiangSiyuan21 / BadCLIP
☆30Updated last year
OPTML-Group / QF-Attack
[CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu
☆26Updated last year