Yuchen413/text2image_safety

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yuchen413/text2image_safety)

Yuchen413 / text2image_safety

☆197

Alternatives and similar repositories for text2image_safety

Users that are interested in text2image_safety are comparing it to the libraries listed below

Sorting:

researchcode001 / daca
View on GitHub
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆18Feb 16, 2025Updated last year
YitingQu / unsafe-diffusion
View on GitHub
☆46Jul 14, 2024Updated last year
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆16Jul 15, 2024Updated last year
ml-research / Q16
View on GitHub
☆35May 22, 2024Updated last year
cure-lab / MMA-Diffusion
View on GitHub
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
☆384Jan 8, 2026Updated last month
Lucas-TY / llm_Implicit_reference
View on GitHub
Official Implementation of implicit reference attack
☆11Oct 16, 2024Updated last year
OPTML-Group / QF-Attack
View on GitHub
[CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu
☆26Aug 27, 2024Updated last year
NYU-DICE-Lab / circumventing-concept-erasure
View on GitHub
☆23Feb 5, 2026Updated last month
OPTML-Group / Diffusion-MU-Attack
View on GitHub
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…
☆87Feb 28, 2025Updated last year
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆13Jan 14, 2026Updated last month
zhiyichin / P4D
View on GitHub
[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…
☆52Jan 11, 2026Updated last month
datar001 / Revealing-Vulnerabilities-in-Stable-Diffusion-via-Targeted-Attacks
View on GitHub
☆11Sep 10, 2024Updated last year
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆192Jun 26, 2025Updated 8 months ago
LetterLiGo / SafeGen_CCS2024
View on GitHub
[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models
☆138Jul 1, 2025Updated 8 months ago
SaFo-Lab / Awesome-T2I-safety-Papers
View on GitHub
List of T2I safety papers, updated daily, welcome to discuss using Discussions
☆67Aug 12, 2024Updated last year
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆94Dec 20, 2025Updated 2 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆59Jun 5, 2024Updated last year
Jinxiaolong1129 / Foot-in-the-door-Jailbreak
View on GitHub
☆19May 14, 2025Updated 9 months ago
chiayi-hsu / Ring-A-Bell
View on GitHub
☆38Jan 15, 2025Updated last year
YancyKahn / CoA
View on GitHub
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆39Jan 17, 2025Updated last year
yunqing-me / AttackVLM
View on GitHub
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
☆228Dec 22, 2024Updated last year
liuxuannan / Awesome-Multimodal-Jailbreak
View on GitHub
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆308Jan 11, 2026Updated last month
WUSTL-CSPL / RIATIG
View on GitHub
☆28May 28, 2023Updated 2 years ago
MaTengSYSU / HIMRD-jailbreak
View on GitHub
Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"
☆15Aug 7, 2025Updated 6 months ago
verazuo / prompt-stealing-attack
View on GitHub
[USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models
☆51Jan 11, 2025Updated last year
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆80Jun 6, 2024Updated last year
ml-research / safe-latent-diffusion
View on GitHub
Official Implementation of Safe Latent Diffusion for Text2Image
☆94Apr 21, 2023Updated 2 years ago
Unispac / Fight-Poison-With-Poison
View on GitHub
Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples
☆30Jul 11, 2023Updated 2 years ago
AI45Lab / ActorAttack
View on GitHub
☆122Feb 3, 2025Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆192Oct 15, 2024Updated last year
HanxunH / Detect-CLIP-Backdoor-Samples
View on GitHub
[ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining
☆19Feb 26, 2025Updated last year
alipay / YiJian-Community
View on GitHub
YiJian-Comunity: a full-process automated large model safety evaluation tool designed for academic research
☆114Dec 15, 2025Updated 2 months ago
naver-ai / JOOD
View on GitHub
[CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"
☆21Jun 11, 2025Updated 8 months ago
Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
View on GitHub
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆266May 13, 2024Updated last year
GuanlinLee / ART
View on GitHub
Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)
☆23Oct 23, 2024Updated last year
weizeming / momentum-attack-llm
View on GitHub
☆23Jan 17, 2025Updated last year
cnut1648 / Model-Fingerprint
View on GitHub
Fingerprint large language models
☆49Jul 11, 2024Updated last year
RU-System-Software-and-Security / NONE
View on GitHub
☆10Oct 31, 2022Updated 3 years ago
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated last year