euanong/image-hijacks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/euanong/image-hijacks)

euanong / image-hijacks

Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime

☆54

Alternatives and similar repositories for image-hijacks

Users that are interested in image-hijacks are comparing it to the libraries listed below

Sorting:

Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
View on GitHub
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆266May 13, 2024Updated last year
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆59Jun 5, 2024Updated last year
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆35Oct 23, 2024Updated last year
CGCL-codes / Gen-AF
View on GitHub
The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".
☆11Jun 28, 2024Updated last year
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆192Jun 26, 2025Updated 8 months ago
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆13Jan 14, 2026Updated last month
Haochen-Luo / CroPA
View on GitHub
☆55Dec 7, 2024Updated last year
thu-ml / Attack-Bard
View on GitHub
☆109Feb 16, 2024Updated 2 years ago
adversarial-for-goodness / Co-Attack
View on GitHub
official PyTorch implement of Towards Adversarial Attack on Vision-Language Pre-training Models
☆65Mar 20, 2023Updated 2 years ago
ebagdasa / multimodal_injection
View on GitHub
☆98Oct 15, 2023Updated 2 years ago
yunqing-me / AttackVLM
View on GitHub
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
☆228Dec 22, 2024Updated last year
UCSC-VLAA / vllm-safety-benchmark
View on GitHub
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆86Nov 28, 2023Updated 2 years ago
YitingQu / unsafe-diffusion
View on GitHub
☆46Jul 14, 2024Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆192Oct 15, 2024Updated last year
fzwark / Secure_LLM_System
View on GitHub
☆14Mar 9, 2025Updated 11 months ago
facebookresearch / prompt-siren
View on GitHub
A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities a…
☆39Updated this week
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆13Dec 16, 2024Updated last year
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated 9 months ago
sail-sg / DiffMemorize
View on GitHub
[TMLR 2025] On Memorization in Diffusion Models
☆31Oct 5, 2023Updated 2 years ago
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
View on GitHub
☆58Aug 11, 2024Updated last year
TrustAIRLab / VoiceJailbreakAttack
View on GitHub
Code for Voice Jailbreak Attacks Against GPT-4o.
☆36May 31, 2024Updated last year
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆31Jan 10, 2025Updated last year
roywang021 / UMK
View on GitHub
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Dec 30, 2024Updated last year
TreeLLi / APT
View on GitHub
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
☆58Dec 20, 2024Updated last year
jpzhang1810 / LDM-Robustness
View on GitHub
Pytorch implementation for the pilot study on the robustness of latent diffusion models.
☆13Jun 20, 2023Updated 2 years ago
jpzhang1810 / TGR
View on GitHub
Official Pytorch implementation for "Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization" (CVPR 20…
☆28Jul 18, 2023Updated 2 years ago
facebookresearch / multimodal-fusion-jailbreaks
View on GitHub
Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)
☆19Oct 22, 2024Updated last year
wbopan / safety-residual-space
View on GitHub
☆21Mar 20, 2025Updated 11 months ago
inspire-group / tta_risk
View on GitHub
☆14Jun 6, 2023Updated 2 years ago
isXinLiu / Awesome-MLLM-Safety
View on GitHub
Accepted by IJCAI-24 Survey Track
☆231Aug 25, 2024Updated last year
TeamPigeonLab / CS-DJ
View on GitHub
Accept by CVPR 2025 (highlight)
☆22Jun 8, 2025Updated 8 months ago
compsec-snu / pfi
View on GitHub
PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
☆26Mar 26, 2025Updated 11 months ago
google-research / active-adversarial-tests
View on GitHub
Official implementation of the paper "Increasing Confidence in Adversarial Robustness Evaluations"
☆20Feb 20, 2026Updated 2 weeks ago
liudaizong / Awesome-LVLM-Attack
View on GitHub
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆505Feb 17, 2026Updated 2 weeks ago
ericyinyzy / VLAttack
View on GitHub
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…
☆66Mar 22, 2025Updated 11 months ago
Sadcardation / MLLM-Refusal
View on GitHub
Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models
☆18Oct 16, 2024Updated last year
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated 9 months ago
RYC-98 / GRA
View on GitHub
Official codes for GRA (Accepted by ICCV2023)
☆17Jul 18, 2023Updated 2 years ago
NY1024 / Foundation-Model-Paper-Notes
View on GitHub
☆75Jan 21, 2026Updated last month