YitingQu/unsafe-diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YitingQu/unsafe-diffusion)

YitingQu / unsafe-diffusion

☆50

Alternatives and similar repositories for unsafe-diffusion

Users that are interested in unsafe-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

datar001 / Revealing-Vulnerabilities-in-Stable-Diffusion-via-Targeted-Attacks
View on GitHub
☆11Sep 10, 2024Updated last year
OPTML-Group / Diffusion-MU-Attack
View on GitHub
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…
☆89Feb 28, 2025Updated last year
OPTML-Group / QF-Attack
View on GitHub
[CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu
☆27Aug 27, 2024Updated last year
GuanlinLee / ART
View on GitHub
Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)
☆25Oct 23, 2024Updated last year
Yuchen413 / text2image_safety
View on GitHub
☆202Apr 7, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ml-research / safe-latent-diffusion
View on GitHub
Official Implementation of Safe Latent Diffusion for Text2Image
☆99Apr 21, 2023Updated 3 years ago
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆17Jul 15, 2024Updated 2 years ago
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆101Dec 20, 2025Updated 7 months ago
chiayi-hsu / Ring-A-Bell
View on GitHub
☆45Jan 15, 2025Updated last year
rrgeorge-pdcontributions / NSFW-Words-List
View on GitHub
Text file containing NSFW words aggregated from various sources.
☆12Aug 23, 2020Updated 5 years ago
WUSTL-CSPL / RIATIG
View on GitHub
☆28May 28, 2023Updated 3 years ago
RYC-98 / GRA
View on GitHub
Official codes for GRA (Accepted by ICCV2023)
☆17Jul 18, 2023Updated 3 years ago
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
SaFo-Lab / Awesome-T2I-safety-Papers
View on GitHub
List of T2I safety papers, updated daily, welcome to discuss using Discussions
☆68Aug 12, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aimagelab / safe-clip
View on GitHub
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
☆68Aug 10, 2024Updated last year
zhiyichin / P4D
View on GitHub
[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…
☆51Jan 11, 2026Updated 6 months ago
rt219 / LatentGuard
View on GitHub
This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"
☆54Oct 24, 2024Updated last year
ml-research / i2p
View on GitHub
☆42Jun 1, 2023Updated 3 years ago
NYU-DICE-Lab / circumventing-concept-erasure
View on GitHub
☆23Feb 5, 2026Updated 5 months ago
researchcode001 / daca
View on GitHub
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆19Feb 16, 2025Updated last year
Haochen-Luo / CroPA
View on GitHub
☆56Dec 7, 2024Updated last year
tmllab / 2025_ICLR_PiF
View on GitHub
☆40May 17, 2025Updated last year
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆61Jun 5, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆212Jun 26, 2025Updated last year
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆39Oct 23, 2024Updated last year
YitingQu / UnsafeBench
View on GitHub
☆15Mar 5, 2026Updated 4 months ago
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆218Oct 15, 2024Updated last year
UCSC-VLAA / vllm-safety-benchmark
View on GitHub
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆89Nov 28, 2023Updated 2 years ago
rohitgandikota / erasing
View on GitHub
Erasing Concepts from Diffusion Models
☆664Mar 26, 2026Updated 3 months ago
euanong / image-hijacks
View on GitHub
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
☆56Sep 19, 2023Updated 2 years ago
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆93Jun 6, 2024Updated 2 years ago
AI45Lab / VLSBench
View on GitHub
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆62Jul 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ys-zong / VLGuard
View on GitHub
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
☆90Jan 19, 2025Updated last year
Huang-yihao / Personalization-based_backdoor
View on GitHub
☆12Dec 18, 2024Updated last year
XiuchuanLi / fmixup
View on GitHub
TIFS2022: Decision-based Adversarial Attack with Frequency Mixup
☆22Aug 8, 2023Updated 2 years ago
LetterLiGo / SafeGen_CCS2024
View on GitHub
[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models
☆138Mar 30, 2026Updated 3 months ago
sduzpf / UAP_VLP
View on GitHub
Universal Adversarial Perturbations for Vision-Language Pre-trained Models
☆24Aug 8, 2025Updated 11 months ago
KxPlaug / TAA-Bench
View on GitHub
☆13Feb 1, 2024Updated 2 years ago
Qinyu-Allen-Zhao / LVLM-LP
View on GitHub
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
☆43Nov 1, 2024Updated last year