liuxuannan / Awesome-Multimodal-Jailbreak
View external linksLinks

A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models

☆302

Alternatives and similar repositories for Awesome-Multimodal-Jailbreak

Users that are interested in Awesome-Multimodal-Jailbreak are comparing it to the libraries listed below

Sorting:

NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆57Jun 5, 2024Updated last year
liudaizong / Awesome-LVLM-Attack
View on GitHub
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆490Jan 27, 2026Updated 3 weeks ago
tmllab / 2025_ICLR_PiF
View on GitHub
☆39May 17, 2025Updated 9 months ago
CryptoAILab / Awesome-LM-SSP
View on GitHub
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆1,860Jan 24, 2026Updated 3 weeks ago
MaTengSYSU / HIMRD-jailbreak
View on GitHub
Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"
☆15Aug 7, 2025Updated 6 months ago
TeamPigeonLab / CS-DJ
View on GitHub
Accept by CVPR 2025 (highlight)
☆22Jun 8, 2025Updated 8 months ago
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆90Dec 20, 2025Updated last month
liuxuannan / MMFakeBench
View on GitHub
[ICLR 2025] MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
☆43Mar 25, 2025Updated 10 months ago
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆27Jun 11, 2025Updated 8 months ago
cuixing100876 / InstaStyle
View on GitHub
☆15Jul 24, 2024Updated last year
isXinLiu / Awesome-MLLM-Safety
View on GitHub
Accepted by IJCAI-24 Survey Track
☆231Aug 25, 2024Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆187Oct 15, 2024Updated last year
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
View on GitHub
☆57Aug 11, 2024Updated last year
liuxuannan / Stochastic-Gradient-Aggregation
View on GitHub
Official implementation of the ICCV2023 paper: Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregatio…
☆27Aug 17, 2023Updated 2 years ago
YitingQu / unsafe-diffusion
View on GitHub
☆47Jul 14, 2024Updated last year
Yuchen413 / text2image_safety
View on GitHub
☆197Apr 7, 2025Updated 10 months ago
AI45Lab / VLSBench
View on GitHub
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆54Jul 21, 2025Updated 6 months ago
chen37058 / Red-Team-Arxiv-Paper-Update
View on GitHub
Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)
☆94Updated this week
zhiyichin / P4D
View on GitHub
[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…
☆51Jan 11, 2026Updated last month
roywang021 / UMK
View on GitHub
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Dec 30, 2024Updated last year
OSU-NLP-Group / AgentSafety
View on GitHub
☆174Oct 31, 2025Updated 3 months ago
facebookresearch / advprompter
View on GitHub
Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873
☆177May 6, 2024Updated last year
datar001 / Revealing-Vulnerabilities-in-Stable-Diffusion-via-Targeted-Attacks
View on GitHub
☆12Sep 10, 2024Updated last year
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆79Jun 6, 2024Updated last year
SaFo-Lab / JailBreakV_28K
View on GitHub
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆86May 9, 2025Updated 9 months ago
ydyjya / Awesome-LLM-Safety
View on GitHub
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide…
☆1,771Feb 1, 2026Updated 2 weeks ago
Unispac / shallow-vs-deep-alignment
View on GitHub
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
☆174Apr 23, 2025Updated 9 months ago
SheltonLiu-N / AutoDAN
View on GitHub
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…
☆427Jan 22, 2025Updated last year
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆191Jun 26, 2025Updated 7 months ago
EasyJailbreak / EasyJailbreak
View on GitHub
An easy-to-use Python framework to generate adversarial jailbreak prompts.
☆815Mar 27, 2025Updated 10 months ago
wonderNefelibata / Awesome-LRM-Safety
View on GitHub
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …
☆82Updated this week
zju-muslab / AdvReverb
View on GitHub
Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"
☆18Nov 26, 2023Updated 2 years ago
yueliu1999 / Awesome-Jailbreak-on-LLMs
View on GitHub
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, data…
☆1,205Feb 6, 2026Updated last week
cure-lab / MMA-Diffusion
View on GitHub
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
☆383Jan 8, 2026Updated last month
wangyu-ovo / MML
View on GitHub
Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"
☆26Dec 6, 2024Updated last year
Sadcardation / MLLM-Refusal
View on GitHub
Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models
☆18Oct 16, 2024Updated last year
naver-ai / JOOD
View on GitHub
[CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"
☆20Jun 11, 2025Updated 8 months ago
AI45Lab / ActorAttack
View on GitHub
☆121Feb 3, 2025Updated last year
NY1024 / Jailbreak_GPT4o
View on GitHub
☆26Jun 5, 2024Updated last year

liuxuannan / Awesome-Multimodal-JailbreakView external linksLinks

Alternatives and similar repositories for Awesome-Multimodal-Jailbreak

liuxuannan / Awesome-Multimodal-Jailbreak
View external linksLinks