huanranchen/VLMTransfer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huanranchen/VLMTransfer)

huanranchen / VLMTransfer

A package that achieves 95%+ transfer attack success rate against GPT-4

☆26

Alternatives and similar repositories for VLMTransfer

Users that are interested in VLMTransfer are comparing it to the libraries listed below

Sorting:

RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated 9 months ago
qizhangli / MoreBayesian-attack
View on GitHub
Code for our ICLR 2023 paper Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples.
☆18May 31, 2023Updated 2 years ago
huanranchen / Visualize-Loss-Landscape
View on GitHub
Respect to the input tensor instead of paramters of NN
☆21Jul 18, 2022Updated 3 years ago
davide-coccomini / Adversarial-Magnification-to-Deceive-Deepfake-Detection-through-Super-Resolution
View on GitHub
Official code for the paper "Adversarial Magnification to Deceive Deepfake Detection through Super Resolution"
☆12Jun 26, 2023Updated 2 years ago
vwesselkamp / deepfake-fingerprint-attacks
View on GitHub
Code accompanying the 2022 DLS paper "Misleading Deep-Fake Detection with GAN Fingerprints"
☆10May 26, 2022Updated 3 years ago
zhaisf / CLiD
View on GitHub
[NeurIPS 2024] "Membership Inference on Text-to-image Diffusion Models via Conditional Likelihood Discrepancy"
☆12Sep 15, 2025Updated 5 months ago
Haochen-Luo / CroPA
View on GitHub
☆55Dec 7, 2024Updated last year
Trustworthy-AI-Group / PGN
View on GitHub
[NeurIPS 2023] Boosting Adversarial Transferability by Achieving Flat Local Maxima
☆34Feb 23, 2024Updated 2 years ago
wbopan / safety-residual-space
View on GitHub
☆21Mar 20, 2025Updated 11 months ago
Muzammal-Naseer / DCViT-AT
View on GitHub
Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)
☆20Aug 24, 2023Updated 2 years ago
inspire-group / tta_risk
View on GitHub
☆14Jun 6, 2023Updated 2 years ago
xavihart / Diff-Protect
View on GitHub
🛡️[ICLR'2024] Toward effective protection against diffusion-based mimicry through score distillation, a.k.a SDS-Attack
☆61Apr 7, 2024Updated last year
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆130Feb 19, 2025Updated last year
chuangchuangtan / Data-Independent-Operator
View on GitHub
Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection
☆17Mar 19, 2024Updated last year
tedbackdoordefense / ted
View on GitHub
☆23Dec 14, 2023Updated 2 years ago
Sadcardation / MLLM-Refusal
View on GitHub
Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models
☆18Oct 16, 2024Updated last year
Gwinhen / DRUPE
View on GitHub
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆20Jan 27, 2024Updated 2 years ago
sutd-visual-computing-group / transferable-forensic-features
View on GitHub
[ECCV 2022: Oral] In this work, we discover that color is a crtical transferable forensic feature (T-FF) in universal detectors for detec…
☆16Jan 25, 2023Updated 3 years ago
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated 9 months ago
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆80Jun 6, 2024Updated last year
tobuta / evadingfakedetector
View on GitHub
We propose a statistical consistency attack (StatAttack) against diverse DeepFake detectors.
☆18Aug 16, 2023Updated 2 years ago
yuyang-long / SSA
View on GitHub
Spectrum simulation attack (ECCV'2022 Oral) towards boosting the transferability of adversarial examples
☆116Jul 21, 2022Updated 3 years ago
weizeming / momentum-attack-llm
View on GitHub
☆23Jan 17, 2025Updated last year
SproutNan / AI-Safety_SCAV
View on GitHub
This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
☆47Oct 13, 2025Updated 4 months ago
Framartin / lgv-geometric-transferability
View on GitHub
Source of the ECCV22 paper "LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity"
☆18Mar 12, 2025Updated 11 months ago
SaFo-Lab / AGrail4Agent
View on GitHub
[ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
☆33Aug 4, 2025Updated 7 months ago
jiamingzhang94 / AnyAttack
View on GitHub
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆66Aug 7, 2025Updated 7 months ago
Megum1 / CO-SPY
View on GitHub
[CVPR'25] CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI
☆39Jan 8, 2026Updated 2 months ago
JerryMazeyu / DRA-BlackBoxAttack
View on GitHub
An unofficial implementation of the paper《Towards Understanding and Boosting Adversarial Transferability from a Distribution Perspective》
☆22Nov 24, 2022Updated 3 years ago
jjhuangcs / AdvINN
View on GitHub
Official Code of "Imperceptible Adversarial Attack via Invertible Neural Networks"
☆24Jul 24, 2024Updated last year
nctu-eva-lab / AntifakePrompt
View on GitHub
This is the official implementation of AntifakePrompt.
☆45Aug 15, 2024Updated last year
HorizonTEL / AIGIBench
View on GitHub
☆41Feb 20, 2026Updated 2 weeks ago
Jayfeather1024 / DensePure
View on GitHub
☆20Oct 5, 2023Updated 2 years ago
ylhz / tf_to_pytorch_model
View on GitHub
Convert tensorflow model to pytorch model via [MMdnn](https://github.com/microsoft/MMdnn) for adversarial attacks.
☆94Dec 1, 2022Updated 3 years ago
dreamflake / CFM
View on GitHub
[CVPR 2023] Official implementation of the Clean Feature Mixup (CFM) method
☆23May 25, 2023Updated 2 years ago
DongzeLi-CASIA / Style-atk
View on GitHub
Author implementation of Exploring Adversarial Fake Images on Face Manifold (CVPR 2021 oral)
☆32Mar 2, 2023Updated 3 years ago
yjw1029 / Self-Reminder-Data
View on GitHub
Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"
☆20Oct 26, 2023Updated 2 years ago
meet-cjli / CTRL
View on GitHub
An Embarrassingly Simple Backdoor Attack on Self-supervised Learning
☆20Jan 24, 2024Updated 2 years ago
Trustworthy-AI-Group / TransferAttack
View on GitHub
TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.
☆461Feb 27, 2026Updated last week