ebagdasa/multimodal_injection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ebagdasa/multimodal_injection)

ebagdasa / multimodal_injection

☆101

Alternatives and similar repositories for multimodal_injection

Users that are interested in multimodal_injection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ebagdasa / adversarial_illusions
View on GitHub
Code for "Adversarial Illusions in Multi-Modal Embeddings"
☆32Aug 4, 2024Updated last year
AoiDragon / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆41Oct 17, 2024Updated last year
euanong / image-hijacks
View on GitHub
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
☆56Sep 19, 2023Updated 2 years ago
ChengshuaiZhao0 / The-Wolf-Within
View on GitHub
☆13Updated this week
Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
View on GitHub
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆281May 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆211Jun 26, 2025Updated last year
Cinofix / sponge_poisoning_energy_latency_attack
View on GitHub
Source code for the Energy-Latency Attacks via Sponge Poisoning paper.
☆15Mar 14, 2022Updated 4 years ago
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆93Jun 6, 2024Updated 2 years ago
UCSC-VLAA / vllm-safety-benchmark
View on GitHub
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆89Nov 28, 2023Updated 2 years ago
Algorithmic-Alignment-Lab / CommonClaim
View on GitHub
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
☆15Jun 21, 2023Updated 3 years ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆61Jun 5, 2024Updated 2 years ago
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
View on GitHub
☆63Aug 11, 2024Updated last year
xijia-tao / ImgTrojan
View on GitHub
Code and data for "ImgTrojan: Jailbreaking Vision-Language Models with ONE Image"
☆24Mar 26, 2025Updated last year
papersPapers / BadPrompt
View on GitHub
Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"
☆41Jul 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated last year
Haochen-Luo / CroPA
View on GitHub
☆56Dec 7, 2024Updated last year
cleverhans-lab / dataset-inference
View on GitHub
[ICLR'21] Dataset Inference for Ownership Resolution in Machine Learning
☆31Oct 10, 2022Updated 3 years ago
thu-ml / Attack-Bard
View on GitHub
☆108Feb 16, 2024Updated 2 years ago
g0h4n / dende-rs
View on GitHub
Monitoring tool to detect patterns or IOCs (strings, regex, VirusTotal) and alert you and your team via console, Telegram or SMS written …
☆17Feb 17, 2026Updated 5 months ago
kenny-co / sgd-uap-torch
View on GitHub
Universal Adversarial Perturbations (UAPs) for PyTorch
☆50Aug 28, 2021Updated 4 years ago
agencyenterprise / PromptInject
View on GitHub
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆510Apr 27, 2026Updated 2 months ago
ethz-spylab / autoadvexbench
View on GitHub
☆42May 21, 2025Updated last year
pratyushmaini / llm_dataset_inference
View on GitHub
Official Repository for Dataset Inference for LLMs
☆41Jul 25, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mbzuai-nlp / AudioJailbreak
View on GitHub
Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
☆32Oct 6, 2025Updated 9 months ago
CryptoAILab / Awesome-LM-SSP
View on GitHub
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆2,021Jun 17, 2026Updated last month
StavC / PromptWares
View on GitHub
A Jailbroken GenAI Model Can Cause Real Harm: GenAI-powered Applications are Vulnerable to PromptWares
☆15Aug 17, 2025Updated 11 months ago
dreadnode / research
View on GitHub
General research for Dreadnode
☆28Jun 17, 2024Updated 2 years ago
Sadcardation / ImageProtector
View on GitHub
Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…
☆19Apr 17, 2026Updated 3 months ago
researchcode001 / daca
View on GitHub
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆19Feb 16, 2025Updated last year
alkaet / LobotoMl
View on GitHub
LobotoMl is a set of scripts and tools to assess production deployments of ML services
☆10May 16, 2022Updated 4 years ago
Fitretech-Security / dylight
View on GitHub
macOS dylib stager
☆37Jan 22, 2025Updated last year
dropbox / bhakti
View on GitHub
Bundle of security analysis scripts for keras tensorflow models
☆16Apr 15, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HanxunH / Detect-CLIP-Backdoor-Samples
View on GitHub
[ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining
☆21Feb 26, 2025Updated last year
alienkeric / VisualStudio-RCE-EvilSln
View on GitHub
A New Exploitation Technique for Visual Studio Projects
☆13Nov 5, 2023Updated 2 years ago
microsoft / BIPIA
View on GitHub
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆147Apr 15, 2024Updated 2 years ago
DominicBreuker / SliverSamples
View on GitHub
A collection of sample code used in some experiments with Sliver C2
☆17Mar 28, 2023Updated 3 years ago
llm-attacks / llm-attacks
View on GitHub
Universal and Transferable Attacks on Aligned Language Models
☆4,740Aug 2, 2024Updated last year
JayGLXR / RustySpy
View on GitHub
A powerful Windows UI monitoring and DNS exfiltration tool written in Rust, combining advanced UI event capture capabilities with secure …
☆20Mar 6, 2025Updated last year
liyy201912 / PaS
View on GitHub
☆11May 3, 2022Updated 4 years ago