Trustworthy-AI-Group / Adversarial_Examples_PapersLinks

A list of recent papers about adversarial learning

☆192

Alternatives and similar repositories for Adversarial_Examples_Papers

Users that are interested in Adversarial_Examples_Papers are comparing it to the libraries listed below

Sorting:

zihao-ai / Awesome-Backdoor-in-Deep-Learning
A curated list of papers & resources on backdoor attacks and defenses in deep learning.
☆216Updated last year
penghui-yang / awesome-data-poisoning-and-backdoor-attacks
A curated list of papers & resources linked to data poisoning, backdoor attacks and defenses against them (no longer maintained)
☆266Updated 6 months ago
thu-ml / Attack-Bard
☆102Updated last year
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆215Updated last month
JindongGu / Awesome_Adversarial_Transferability
A curated list of papers for the transferability of adversarial examples
☆72Updated last year
ericyinyzy / VLAttack
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…
☆56Updated 4 months ago
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆183Updated 5 months ago
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆29Updated 7 months ago
qingjiesjtu / USC
This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.
☆58Updated 7 months ago
ZhengyuZhao / TransferAttackEval
Revisiting Transferable Adversarial Images (arXiv)
☆124Updated 4 months ago
huanranchen / AdversarialAttacks
☆76Updated last year
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆349Updated last week
Trustworthy-AI-Group / TransferAttack
TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.
☆378Updated 2 weeks ago
bboylyg / BackdoorLLM
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
☆188Updated last month
grasses / PromptCARE
Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.
☆32Updated 11 months ago
VinAIResearch / Warping-based_Backdoor_Attack-release
WaNet - Imperceptible Warping-based Backdoor Attack (ICLR 2021)
☆127Updated 8 months ago
NY1024 / Foundation-Model-Paper-Notes
☆58Updated 2 months ago
yuezunli / ISSBA
Invisible Backdoor Attack with Sample-Specific Triggers
☆97Updated 3 years ago
ZJZAC / awesome-deep-model-IP-protection
☆40Updated 3 years ago
shenyizg / NewAdversarialAttackPaper
A list of recent adversarial attack and defense papers (including those on large language models)
☆42Updated this week
GuanlinLee / ART
Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)
☆16Updated 9 months ago
jinyuan-jia / BadEncoder
☆82Updated 4 years ago
Gwinhen / BackdoorVault
A toolbox for backdoor attacks.
☆22Updated 2 years ago
yunqing-me / AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
☆209Updated 7 months ago
jiamingzhang94 / AnyAttack
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆41Updated last month
KuofengGao / Verbose_Images
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆37Updated last year
ThuCCSLab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆160Updated last month
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆48Updated last year
SCLBD / BlackboxBench
☆112Updated 2 months ago
shaoshuo-ss / EaaW
[NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…
☆39Updated 9 months ago