Dtc7w3PQ / Visco-AttackLinks

Official implementation of Visco-Attack (EMNLP 2025 Main). We will progressively release the code and one-click reproduction scripts.

☆26

Alternatives and similar repositories for Visco-Attack

Users that are interested in Visco-Attack are comparing it to the libraries listed below

Sorting:

isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆179Updated last year
NY1024 / Foundation-Model-Paper-Notes
☆71Updated 7 months ago
CryptoAILab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆182Updated 6 months ago
roywang021 / IDEATOR
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆14Updated 5 months ago
ASTRAL-Group / ASTRA
[CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbre…
☆47Updated 5 months ago
RUCAIBox / HADES
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆33Updated last year
Haochen-Luo / CroPA
☆54Updated last year
wangyu-ovo / MML
Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"
☆25Updated last year
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
☆65Updated 8 months ago
erfanshayegani / Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆77Updated last year
YiyiyiZhao / siren
Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …
☆13Updated 3 months ago
PKU-ML / PAT
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆22Updated 7 months ago
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
☆54Updated last year
tmllab / 2025_ICLR_PiF
☆37Updated 7 months ago
huanranchen / VLMTransfer
A package that achieves 95%+ transfer attack success rate against GPT-4
☆25Updated last year
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Updated 11 months ago
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆225Updated last year
SaFo-Lab / JailBreakV_28K
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆84Updated 7 months ago
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆292Updated last month
zihao-ai / unthinking_vulnerability
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
☆32Updated 7 months ago
DSN-2024 / DSN
DSN jailbreak Attack & Evaluation Ensemble
☆14Updated last month
AI45Lab / VLSBench
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆52Updated 5 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆54Updated last year
ybwang119 / label_recovery
[ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks
☆14Updated last year
ledllm / ledllm
☆25Updated last year
AI-secure / MMDT
Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models
☆25Updated 9 months ago
TeamPigeonLab / CS-DJ
Accept by CVPR 2025 (highlight)
☆21Updated 6 months ago
wonderNefelibata / Awesome-LRM-Safety
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …
☆78Updated this week
SproutNan / AI-Safety_SCAV
This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
☆47Updated 2 months ago
Vinsonzyh / BlueSuffix
[ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
☆30Updated last month