thu-ml / Attack-BardLinks

☆105

Alternatives and similar repositories for Attack-Bard

Users that are interested in Attack-Bard are comparing it to the libraries listed below

Sorting:

KuofengGao / Verbose_Images
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆40Updated last year
yunqing-me / AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
☆216Updated 10 months ago
erfanshayegani / Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆73Updated last year
ericyinyzy / VLAttack
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…
☆58Updated 7 months ago
thu-ml / MMTrustEval
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
☆168Updated 4 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆53Updated last year
umd-huang-lab / VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆56Updated 9 months ago
euanong / image-hijacks
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
☆51Updated 2 years ago
Haochen-Luo / CroPA
☆52Updated 10 months ago
RUCAIBox / HADES
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆30Updated last year
Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆243Updated last year
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
☆63Updated 7 months ago
huanranchen / VLMTransfer
A package that achieves 95%+ transfer attack success rate against GPT-4
☆23Updated last year
CryptoAILab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆177Updated 4 months ago
sail-sg / AnyDoor
AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models
☆59Updated last year
rain152 / PAT
[NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning
☆10Updated last year
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
☆48Updated last year
jiaxiaojunQAQ / FOA-Attack
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment (NeurIPS 2025)
☆36Updated 2 weeks ago
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆30Updated 10 months ago
XuanChen-xc / RLbreaker
Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)
☆13Updated last year
isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆169Updated last year
huanranchen / AdversarialAttacks
☆79Updated last year
jiamingzhang94 / AnyAttack
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆55Updated 2 months ago
TeamPigeonLab / CS-DJ
Accept by CVPR 2025 (highlight)
☆19Updated 4 months ago
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆244Updated last month
PKU-ML / PAT
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆19Updated 5 months ago
Alibaba-AAIG / Oyster
The Oyster series is a set of safety models developed in-house by Alibaba-AAIG, devoted to building a responsible AI ecosystem. | Oyster …
☆52Updated last month
jiawangbai / BadCLIP
Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf
☆21Updated last year
qingjiesjtu / USC
This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.
☆63Updated 10 months ago
SaFoLab-WISC / JailBreakV_28K
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆80Updated 5 months ago