OPTML-Group / AdvUnlearnLinks
Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models". This work adversarially unlearns the text encoder to enhance the robustness of unlearned DMs against adversarial prompt attacks and achieves a better balance between unlearning performance and image generat…
☆49Updated last year
Alternatives and similar repositories for AdvUnlearn
Users that are interested in AdvUnlearn are comparing it to the libraries listed below
Sorting:
- ☆37Updated last year
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆86Updated 11 months ago
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆83Updated last year
- ☆23Updated 2 years ago
- [CVPR'24 Oral] Metacloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning☆29Updated last year
- [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…☆51Updated 3 weeks ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆140Updated 8 months ago
- [MM '24] EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second☆27Updated last year
- 🛡️[ICLR'2024] Toward effective protection against diffusion-based mimicry through score distillation, a.k.a SDS-Attack☆59Updated last year
- ☆65Updated last year
- A collection of resources on attacks and defenses targeting text-to-image diffusion models☆90Updated last month
- Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf☆23Updated last year
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk☆16Updated 10 months ago
- ☆59Updated 3 years ago
- Official repo for An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization☆16Updated last year
- ☆34Updated last year
- [ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks☆38Updated 9 months ago
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆22Updated last year
- List of T2I safety papers, updated daily, welcome to discuss using Discussions☆67Updated last year
- [CVPR 2024] official code for SimAC☆21Updated last year
- ☆28Updated last year
- PDM-based Purifier☆22Updated last year
- [CVPR'25]Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks☆29Updated 7 months ago
- ☆46Updated last year
- ☆33Updated 9 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Updated last year
- ☆13Updated last year
- Code for the paper "Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks"☆39Updated last year
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆95Updated 4 months ago
- [ECCV2024] Immunizing text-to-image Models against Malicious Adaptation☆17Updated last year