Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models". This work adversarially unlearns the text encoder to enhance the robustness of unlearned DMs against adversarial prompt attacks and achieves a better balance between unlearning performance and image generat…
☆49Nov 4, 2024Updated last year
Alternatives and similar repositories for AdvUnlearn
Users that are interested in AdvUnlearn are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆82Nov 11, 2024Updated last year
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆87Feb 28, 2025Updated last year
- [ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)☆44Mar 2, 2025Updated last year
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆87Oct 29, 2024Updated last year
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆22Aug 13, 2024Updated last year
- ☆16Feb 23, 2025Updated last year
- NeurIPS 2024 - Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation☆17Dec 5, 2024Updated last year
- Unified Concept Editing in Diffusion Models☆184Dec 7, 2025Updated 3 months ago
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- A collection of resources on attacks and defenses targeting text-to-image diffusion models☆96Dec 20, 2025Updated 3 months ago
- ☆39Jan 15, 2025Updated last year
- A repository of resources on machine unlearning for diffusion models☆60Oct 9, 2025Updated 5 months ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆43Oct 3, 2025Updated 5 months ago
- ☆34Aug 26, 2025Updated 6 months ago
- ☆35May 22, 2024Updated last year
- ☆33Apr 22, 2025Updated 11 months ago
- EraseDiff: Erasing Data Influence in Diffusion Models☆14Nov 20, 2024Updated last year
- [CVPR 2025] Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models☆16Jan 8, 2026Updated 2 months ago
- [NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gao…☆84Feb 28, 2026Updated 3 weeks ago
- A toolkit for optimizing machine learning models for practical applications☆31Mar 6, 2025Updated last year
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆11Jun 2, 2024Updated last year
- ☆42Jun 1, 2023Updated 2 years ago
- EraseAnything, ICML 2025☆39Sep 28, 2025Updated 5 months ago
- Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)☆23Oct 23, 2024Updated last year
- Erasing Concepts from Diffusion Models☆657Aug 18, 2025Updated 7 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆55Jan 22, 2025Updated last year
- [CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)☆394Jun 2, 2025Updated 9 months ago
- ☆23Sep 28, 2023Updated 2 years ago
- ☆10Mar 23, 2025Updated 11 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆152Dec 28, 2023Updated 2 years ago
- Official repo for An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization☆16Mar 8, 2024Updated 2 years ago
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆67Aug 10, 2024Updated last year
- ☆23May 9, 2024Updated last year
- Official PyTorch Implementation☆17Dec 3, 2022Updated 3 years ago
- ☆26Oct 6, 2024Updated last year
- Official repository for Targeted Unlearning with Single Layer Unlearning Gradient (SLUG), ICML 2025☆15Aug 10, 2025Updated 7 months ago
- [CVPR 2024] official code for SimAC☆21Jan 23, 2025Updated last year
- Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)☆168Dec 21, 2024Updated last year
- [ECCV 2024] "Prediction Exposes Your Face: Black-box Model Inversion via Prediction Alignment"☆15Mar 12, 2025Updated last year