zhiyichin/P4D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhiyichin/P4D)

zhiyichin / P4D

[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementation)

☆52

Alternatives and similar repositories for P4D

Users that are interested in P4D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OPTML-Group / Diffusion-MU-Attack
View on GitHub
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…
☆89Feb 28, 2025Updated last year
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
chiayi-hsu / Ring-A-Bell
View on GitHub
☆46Jan 15, 2025Updated last year
rrgeorge-pdcontributions / NSFW-Words-List
View on GitHub
Text file containing NSFW words aggregated from various sources.
☆12Aug 23, 2020Updated 5 years ago
ml-research / i2p
View on GitHub
☆42Jun 1, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NYU-DICE-Lab / circumventing-concept-erasure
View on GitHub
☆23Feb 5, 2026Updated 5 months ago
OPTML-Group / AdvUnlearn
View on GitHub
Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Model…
☆54Nov 4, 2024Updated last year
YitingQu / unsafe-diffusion
View on GitHub
☆50Jul 14, 2024Updated 2 years ago
ml-research / safe-latent-diffusion
View on GitHub
Official Implementation of Safe Latent Diffusion for Text2Image
☆99Apr 21, 2023Updated 3 years ago
ml-research / Q16
View on GitHub
☆35May 22, 2024Updated 2 years ago
Yuchen413 / text2image_safety
View on GitHub
☆202Apr 7, 2025Updated last year
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆101Dec 20, 2025Updated 7 months ago
WUSTL-CSPL / RIATIG
View on GitHub
☆28May 28, 2023Updated 3 years ago
CharlesGong12 / RECE
View on GitHub
[ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
☆93Oct 29, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
SaFo-Lab / Awesome-T2I-safety-Papers
View on GitHub
List of T2I safety papers, updated daily, welcome to discuss using Discussions
☆68Aug 12, 2024Updated last year
RPC2 / AutoInject
View on GitHub
☆20Jun 12, 2026Updated last month
rohitgandikota / unified-concept-editing
View on GitHub
Unified Concept Editing in Diffusion Models
☆194Dec 7, 2025Updated 7 months ago
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆17Jul 15, 2024Updated 2 years ago
rohitgandikota / erasing
View on GitHub
Erasing Concepts from Diffusion Models
☆665Mar 26, 2026Updated 4 months ago
sunblaze-ucb / curriculum-adversarial-training-CAT
View on GitHub
☆14Mar 1, 2019Updated 7 years ago
vtddggg / CAA
View on GitHub
The implementation of our paper: Composite Adversarial Attacks (AAAI2021)
☆29Feb 1, 2022Updated 4 years ago
cure-lab / MMA-Diffusion
View on GitHub
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
☆385Jul 10, 2026Updated 2 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Kim-Minseon / APGP
View on GitHub
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
☆15Jun 23, 2024Updated 2 years ago
ZhangZhuoSJTU / LINT
View on GitHub
☆17Sep 4, 2024Updated last year
yunqing-me / AttackVLM
View on GitHub
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
☆231Dec 22, 2024Updated last year
papersPapers / BadPrompt
View on GitHub
Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"
☆41Jul 8, 2024Updated 2 years ago
ShannonAI / backdoor_nlg
View on GitHub
☆18Jul 1, 2021Updated 5 years ago
Guo-Yunzhe / Awesome_BackdoorAttack_against_NeuralNetwork
View on GitHub
A paper summary of Backdoor Attack against Neural Network
☆13Aug 9, 2019Updated 6 years ago
LetterLiGo / SafeGen_CCS2024
View on GitHub
[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models
☆138Mar 30, 2026Updated 3 months ago
PKU-ML / PAT
View on GitHub
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆22May 6, 2025Updated last year
daniter-cu / AdversarialSpeech
View on GitHub
Fooling neural based speech recognition systems.
☆14Jun 9, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CGCL-codes / TransferAttackSurrogates
View on GitHub
The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferabili…
☆20Aug 22, 2024Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆218Oct 15, 2024Updated last year
SobeyMIL / MVOC
View on GitHub
code for "MVOC:atraining-free multiple video object composition method with diffusion models"
☆23Jul 3, 2024Updated 2 years ago
roywang021 / UMK
View on GitHub
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆34Dec 30, 2024Updated last year
RichardSunnyMeng / MSMFN
View on GitHub
IEEE TMI paper: A multi-step modality fusion network for identifying the histologic subtypes of metastatic cervical lymphadenopathy
☆10Nov 23, 2022Updated 3 years ago
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆39Oct 23, 2024Updated last year
alipay / YiJian-Community
View on GitHub
YiJian-Comunity: a full-process automated large model safety evaluation tool designed for academic research
☆113Dec 15, 2025Updated 7 months ago