Visualignment / SafetyDPOLinks
β33Updated 5 months ago
Alternatives and similar repositories for SafetyDPO
Users that are interested in SafetyDPO are comparing it to the libraries listed below
Sorting:
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"β52Updated last year
- π‘οΈ[ICLR'2024] Toward effective protection against diffusion-based mimicry through score distillation, a.k.a SDS-Attackβ59Updated last year
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generationβ53Updated last year
- List of T2I safety papers, updated daily, welcome to discuss using Discussionsβ67Updated last year
- [CVPR'24 Oral] Metacloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learningβ28Updated last year
- [ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Modelsβ17Updated last month
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsβ¦β86Updated 11 months ago
- [CVPR 2024] Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generationβ47Updated last year
- Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Modelβ¦β49Updated last year
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".β35Updated 6 months ago
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairnessβ45Updated last year
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradientβ44Updated 9 months ago
- Official implement of paper: Stable Diffusion is Unstableβ23Updated last year
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Mattersβ43Updated 10 months ago
- β35Updated last year
- A collection of resources on attacks and defenses targeting text-to-image diffusion modelsβ90Updated last month
- [CVPR 2024] official code for SimACβ21Updated last year
- β38Updated last year
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025β28Updated 9 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLMβs Safety without Hurting Performance"β44Updated last year
- β40Updated 2 years ago
- β33Updated 9 months ago
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Worβ¦β22Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β40Updated last year
- β65Updated last year
- Official implementation of "Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models"β25Updated 8 months ago
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"β15Updated 10 months ago
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Choβ¦β83Updated last year
- [ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)β44Updated 11 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Modelsβ110Updated last year