ai-data-model-safety / ai-data-model-safety.github.ioLinks

☆49

Alternatives and similar repositories for ai-data-model-safety.github.io

Users that are interested in ai-data-model-safety.github.io are comparing it to the libraries listed below

Sorting:

shaoshuo-ss / EaaW
[NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…
☆45Updated last year
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆225Updated last week
NY1024 / Foundation-Model-Paper-Notes
☆73Updated 3 weeks ago
Trustworthy-AI-Group / Adversarial_Examples_Papers
A list of recent papers about adversarial learning
☆304Updated last week
whdii / TMM
☆20Updated 2 years ago
AntigoneRandy / SIREN
Official Implementation for "Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models" (IE…
☆27Updated 10 months ago
jiamingzhang94 / AnyAttack
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆66Updated 6 months ago
ericyinyzy / VLAttack
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…
☆66Updated 10 months ago
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆302Updated last month
zihao-ai / Awesome-Backdoor-in-Deep-Learning
A curated list of papers & resources on backdoor attacks and defenses in deep learning.
☆235Updated last year
penghui-yang / awesome-data-poisoning-and-backdoor-attacks
A curated list of papers & resources linked to data poisoning, backdoor attacks and defenses against them (no longer maintained)
☆286Updated last year
adversarial-for-goodness / Co-Attack
official PyTorch implement of Towards Adversarial Attack on Vision-Language Pre-training Models
☆65Updated 2 years ago
T0hsakar1n / RAPID
Source code and scripts for the paper "Is Difficulty Calibration All We Need? Towards More Practical Membership Inference Attacks"
☆20Updated last year
huanranchen / AdversarialAttacks
☆80Updated last year
ZJUICSR / AIcert
☆224Updated 5 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆57Updated last year
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆485Updated 2 weeks ago
Zhou-Zi7 / Awesome-AI-Security-BIG4
This Github repository summarizes a list of research papers on AI security from the four top academic conferences.
☆176Updated 8 months ago
yuezunli / ISSBA
Invisible Backdoor Attack with Sample-Specific Triggers
☆105Updated 3 years ago
gq-max / AdvDiffVLM
☆48Updated 10 months ago
WUSTL-CSPL / RIATIG
☆28Updated 2 years ago
xuxiong0214 / BTIDBF
☆17Updated last year
LiangSiyuan21 / BadCLIP
☆30Updated last year
thu-ml / Attack-Bard
☆109Updated last year
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Updated last year
KuofengGao / Verbose_Images
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆42Updated 2 years ago
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
☆58Updated last year
Trustworthy-AI-Group / TransferAttack
TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.
☆437Updated 3 weeks ago
mengtong0110 / InferDPT
☆34Updated 2 months ago
WUSTL-CSPL / LLMJailbreak
☆37Updated last year