MaTengSYSU/HIMRD-jailbreak

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaTengSYSU/HIMRD-jailbreak)

MaTengSYSU / HIMRD-jailbreak

Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"

☆15

Alternatives and similar repositories for HIMRD-jailbreak

Users that are interested in HIMRD-jailbreak are comparing it to the libraries listed below

Sorting:

TeamPigeonLab / CS-DJ
View on GitHub
Accept by CVPR 2025 (highlight)
☆22Jun 8, 2025Updated 9 months ago
Nathangitlab / Backdoor-Attacks-on-Crowd-Counting
View on GitHub
this is for the ACM MM paper---Backdoor Attack on Crowd Counting
☆17Jul 10, 2022Updated 3 years ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆59Jun 5, 2024Updated last year
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆35Oct 23, 2024Updated last year
jiaxiaojunQAQ / FOA-Attack
View on GitHub
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment (NeurIPS 2025)
☆50Nov 5, 2025Updated 4 months ago
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆27Jun 11, 2025Updated 8 months ago
xpf / Data-Efficient-Backdoor-Attacks
View on GitHub
Data-Efficient Backdoor Attacks
☆20Jun 15, 2022Updated 3 years ago
konpanousis / Adversarial-LWTA-AutoAttack
View on GitHub
☆12May 6, 2022Updated 3 years ago
Ekko-zn / IJCAI2022-Backdoor
View on GitHub
☆20May 6, 2022Updated 3 years ago
reds-lab / ASSET
View on GitHub
This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…
☆19Jun 7, 2023Updated 2 years ago
SewoongLab / spectre-defense
View on GitHub
Defending Against Backdoor Attacks Using Robust Covariance Estimation
☆22Jul 12, 2021Updated 4 years ago
yuplin2333 / representation-space-jailbreak
View on GitHub
Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…
☆23Jul 26, 2024Updated last year
wanlunsec / Beatrix
View on GitHub
☆27Feb 1, 2023Updated 3 years ago
jiaxiaojunQAQ / OmniSafeBench-MM
View on GitHub
A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation
☆61Mar 2, 2026Updated last week
gufranSabri / FSBI
View on GitHub
☆12Jan 25, 2025Updated last year
Huang-yihao / Personalization-based_backdoor
View on GitHub
☆10Dec 18, 2024Updated last year
yjkim721 / STRIP-ViTA
View on GitHub
This work corroborates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs, is a multi-domain Trojan …
☆10Mar 7, 2021Updated 5 years ago
clearloveclearlove / BEAT
View on GitHub
☆14Feb 26, 2025Updated last year
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆13Dec 16, 2024Updated last year
dunky11 / piracy-resistant-watermarks
View on GitHub
Implemention of "Piracy Resistant Watermarks for Deep Neural Networks" in TensorFlow.
☆12Dec 5, 2020Updated 5 years ago
Lucas-TY / llm_Implicit_reference
View on GitHub
Official Implementation of implicit reference attack
☆11Oct 16, 2024Updated last year
jiaxiaojunQAQ / FP-Better
View on GitHub
Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)
☆13Mar 29, 2024Updated last year
thefcraft / prompt-generator-stable-diffusion
View on GitHub
Prompt Generator model for Stable Diffusion Models
☆11Jun 20, 2023Updated 2 years ago
RU-System-Software-and-Security / NONE
View on GitHub
☆10Oct 31, 2022Updated 3 years ago
jiaxiaojunQAQ / FGSM-PGK
View on GitHub
Improving fast adversarial training with prior-guided knowledge (TPAMI2024)
☆43Apr 21, 2024Updated last year
NY1024 / Jailbreak_GPT4o
View on GitHub
☆26Jun 5, 2024Updated last year
HuXiaoling / TopoTrigger
View on GitHub
Codes for the ICLR 2022 paper: Trigger Hunting with a Topological Prior for Trojan Detection
☆11Sep 19, 2023Updated 2 years ago
PurduePAML / Exray
View on GitHub
☆12May 27, 2022Updated 3 years ago
HXZhong1997 / FSBA
View on GitHub
☆11Jan 25, 2022Updated 4 years ago
yanyanSann / Long-Tailed-Classification-Leaderboard
View on GitHub
☆29Mar 3, 2021Updated 5 years ago
roywang021 / UMK
View on GitHub
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆31Dec 30, 2024Updated last year
Sandy-Zeng / NPAttack
View on GitHub
Pytorch implementation of NPAttack
☆12Jul 7, 2020Updated 5 years ago
liuxuannan / Stochastic-Gradient-Aggregation
View on GitHub
Official implementation of the ICCV2023 paper: Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregatio…
☆27Aug 17, 2023Updated 2 years ago
THU-KEG / SafetyNeuron
View on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
☆22Jan 29, 2026Updated last month
RU-System-Software-and-Security / UNICORN
View on GitHub
☆15Apr 7, 2023Updated 2 years ago
uwFengyuan / OCC-CLIP
View on GitHub
☆14Jan 4, 2025Updated last year
VITA-Group / Trap-and-Replace-Backdoor-Defense
View on GitHub
[NeurIPS'22] Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork. Haotao Wang, Junyuan Hong,…
☆15Nov 27, 2023Updated 2 years ago
Unispac / Fight-Poison-With-Poison
View on GitHub
Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples
☆30Jul 11, 2023Updated 2 years ago
rkteddy / channel-Lipschitzness-based-pruning
View on GitHub
Source code for ECCV 2022 Poster: Data-free Backdoor Removal based on Channel Lipschitzness
☆35Jan 9, 2023Updated 3 years ago