wellzline / Trustworthy_T2I_DMsLinks

☆12

Alternatives and similar repositories for Trustworthy_T2I_DMs

Users that are interested in Trustworthy_T2I_DMs are comparing it to the libraries listed below

Sorting:

sail-sg / Meta-Unlearning
☆30Updated 6 months ago
OPTML-Group / Diffusion-MU-Attack
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…
☆86Updated 8 months ago
haonan3 / ICML-2024-Oral-SilentBadDiffusion
☆12Updated 11 months ago
OPTML-Group / AdvUnlearn
Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Model…
☆49Updated 11 months ago
OPTML-Group / UnlearnCanvas
[NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…
☆77Updated 11 months ago
nannullna / safe-diffusion
The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…
☆21Updated last year
UCSC-VLAA / vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆83Updated last year
chiayi-hsu / Ring-A-Bell
☆35Updated 9 months ago
umd-huang-lab / VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆55Updated 9 months ago
THU-BPM / Watermark-Radioactivity-Attack
Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)
☆17Updated 4 months ago
Xiangkui-Cao / VLBiasBench
A large-scale dataset composed of high-quality synthetic images aimed at evaluating social biases in LVLMs
☆13Updated 3 weeks ago
clear-nus / selective-amnesia
☆64Updated last year
YitingQu / unsafe-diffusion
☆38Updated last year
franciscoliu / Awesome-GenAI-Unlearning
☆171Updated 3 months ago
renjie3 / MemAttn
☆13Updated 8 months ago
chenchenygu / watermark-learnability
☆26Updated 8 months ago
joycenerd / P4D
[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…
☆48Updated 11 months ago
xiaojunxu / learning-to-watermark-llm
☆21Updated last year
zihao-ai / unthinking_vulnerability
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
☆32Updated 5 months ago
chs20 / RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
☆147Updated 4 months ago
AI45Lab / REEF
The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…
☆66Updated 9 months ago
sail-sg / DiffMemorize
[TMLR 2025] On Memorization in Diffusion Models
☆27Updated 2 years ago
20000yshust / SWARM
[CVPR 2024] Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
☆16Updated last year
UCSC-VLAA / STAR-1
☆30Updated 6 months ago
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆60Updated last year
datar001 / Awesome-AD-on-T2IDM
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆77Updated 7 months ago
VITA-Group / Shake-to-Leak
☆15Updated 7 months ago
OPTML-Group / Unlearn-Saliency
[ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…
☆137Updated 5 months ago
ml-research / Q16
☆33Updated last year
LiangSiyuan21 / BadCLIP
☆25Updated last year