OPTML-Group / AdvUnlearnLinks
Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models". This work adversarially unlearns the text encoder to enhance the robustness of unlearned DMs against adversarial prompt attacks and achieves a better balance between unlearning performance and image generat…
☆47Updated 10 months ago
Alternatives and similar repositories for AdvUnlearn
Users that are interested in AdvUnlearn are comparing it to the libraries listed below
Sorting:
- ☆35Updated 8 months ago
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆80Updated 6 months ago
- A collection of resources on attacks and defenses targeting text-to-image diffusion models☆73Updated 5 months ago
- [CVPR'24 Oral] Metacloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning☆29Updated 9 months ago
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆75Updated 10 months ago
- 🛡️[ICLR'2024] Toward effective protection against diffusion-based mimicry through score distillation, a.k.a SDS-Attack☆55Updated last year
- [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…☆46Updated 9 months ago
- ☆65Updated 11 months ago
- PDM-based Purifier☆22Updated 10 months ago
- Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf☆21Updated last year
- ☆21Updated last year
- [CVPR 2024] official code for SimAC☆21Updated 7 months ago
- [ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks☆36Updated 4 months ago
- ☆58Updated 2 years ago
- List of T2I safety papers, updated daily, welcome to discuss using Discussions☆64Updated last year
- ☆14Updated 6 months ago
- [MM '24] EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second☆24Updated 9 months ago
- Code of paper [CVPR'24: Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?]☆22Updated last year
- ☆33Updated last year
- ☆28Updated last year
- ☆28Updated 4 months ago
- [ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)☆41Updated 6 months ago
- Official repo for An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization☆15Updated last year
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆92Updated 4 months ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆130Updated 3 months ago
- DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing (ICLR 2025)☆34Updated 3 months ago
- Official implement of paper: Stable Diffusion is Unstable☆23Updated last year
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆26Updated last year
- [ICCV-2025] Universal Adversarial Attack, Multimodal Adversarial Attacks, VLP models, Contrastive Learning, Cross-modal Perturbation Gene…☆24Updated 2 months ago
- ☆21Updated last year