NYU-DICE-Lab / circumventing-concept-erasureView external linksLinks
☆23Feb 5, 2026Updated last week
Alternatives and similar repositories for circumventing-concept-erasure
Users that are interested in circumventing-concept-erasure are comparing it to the libraries listed below
Sorting:
- ☆38Jan 15, 2025Updated last year
- Unified Concept Editing in Diffusion Models☆183Dec 7, 2025Updated 2 months ago
- Official repository for Targeted Unlearning with Single Layer Unlearning Gradient (SLUG), ICML 2025☆15Aug 10, 2025Updated 6 months ago
- [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…☆51Jan 11, 2026Updated last month
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- [CVPR 2025] Official PyTorch Implementation for GLoCE: Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free G…☆19Jul 10, 2025Updated 7 months ago
- Separable Diffusion Model Unlearning☆13Jan 29, 2025Updated last year
- ☆197Apr 7, 2025Updated 10 months ago
- This is the repository for USENIX Security 2023 paper "Hard-label Black-box Universal Adversarial Patch Attack".☆15Sep 5, 2023Updated 2 years ago
- ☆16Apr 21, 2022Updated 3 years ago
- ☆47Jul 14, 2024Updated last year
- ☆48Feb 8, 2025Updated last year
- [CVPR 2024] Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation☆47May 14, 2024Updated last year
- ☆22Apr 15, 2022Updated 3 years ago
- Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode☆18Feb 16, 2025Updated last year
- Official Implementation of Safe Latent Diffusion for Text2Image☆94Apr 21, 2023Updated 2 years ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆141May 27, 2025Updated 8 months ago
- Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023☆136Oct 22, 2025Updated 3 months ago
- [⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing fro…☆29Apr 22, 2025Updated 9 months ago
- Code for NDSS paper: Stealthy Adversarial Perturbations Against Real-Time Video Classification Systems☆21Nov 24, 2018Updated 7 years ago
- Erasing Concepts from Diffusion Models☆655Aug 18, 2025Updated 5 months ago
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆22Mar 19, 2024Updated last year
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆26Aug 27, 2024Updated last year
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 8 months ago
- ☆40Jun 1, 2023Updated 2 years ago
- Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]☆31Jul 15, 2020Updated 5 years ago
- code of paper "IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Gene…☆34May 23, 2024Updated last year
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆16Sep 16, 2025Updated 5 months ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- ☆10Jun 23, 2018Updated 7 years ago
- ☆11May 24, 2024Updated last year
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- Official Implementation for CVPR 2025 paper Instant Adversarial Purification with Adversarial Consistency Distillation.☆14Dec 19, 2025Updated last month
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- Geometric Certifications of Neural Nets☆42Nov 22, 2022Updated 3 years ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆60Sep 25, 2025Updated 4 months ago
- This work corroborates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs, is a multi-domain Trojan …☆10Mar 7, 2021Updated 4 years ago