iliaishacked/sponge_examples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iliaishacked/sponge_examples)

iliaishacked / sponge_examples

☆28

Alternatives and similar repositories for sponge_examples

Users that are interested in sponge_examples are comparing it to the libraries listed below

Sorting:

Cinofix / sponge_poisoning_energy_latency_attack
View on GitHub
Source code for the Energy-Latency Attacks via Sponge Poisoning paper.
☆15Mar 14, 2022Updated 3 years ago
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated 9 months ago
WUSTL-CSPL / SlowLiDAR
View on GitHub
☆12Dec 22, 2023Updated 2 years ago
Sandy-Zeng / NPAttack
View on GitHub
Pytorch implementation of NPAttack
☆12Jul 7, 2020Updated 5 years ago
karandwivedi42 / adversarial
View on GitHub
Pytorch - Adversarial Training
☆26May 9, 2018Updated 7 years ago
nickboucher / imperceptible
View on GitHub
Bad Characters: Imperceptible NLP Attacks
☆35Apr 9, 2024Updated last year
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 6 months ago
KuofengGao / Verbose_Images
View on GitHub
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆42Jan 25, 2024Updated 2 years ago
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆16Jul 15, 2024Updated last year
liuchen11 / AdversaryLossLandscape
View on GitHub
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Jul 3, 2021Updated 4 years ago
declare-lab / ferret
View on GitHub
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
☆18Aug 22, 2024Updated last year
kztakemoto / simbaja
View on GitHub
All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks
☆18Apr 24, 2024Updated last year
Saehyung-Lee / cifar10_challenge
View on GitHub
Code for the CVPR 2020 article "Adversarial Vertex mixup: Toward Better Adversarially Robust Generalization"
☆13Jul 13, 2020Updated 5 years ago
weiyezhimeng / SQL-Injection-Jailbreak
View on GitHub
☆21Jul 26, 2025Updated 7 months ago
winterwindwang / Full-coverage-camouflage-adversarial-attack
View on GitHub
https://winterwindwang.github.io/Full-coverage-camouflage-adversarial-attack/
☆20May 9, 2022Updated 3 years ago
researchcode001 / daca
View on GitHub
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆18Feb 16, 2025Updated last year
yuplin2333 / representation-space-jailbreak
View on GitHub
Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…
☆23Jul 26, 2024Updated last year
ASGuard-UCI / MSF-ADV
View on GitHub
MSF-ADV is a novel physical-world adversarial attack method, which can fool the Multi Sensor Fusion (MSF) based autonomous driving (AD) p…
☆81Aug 4, 2021Updated 4 years ago
adnansirajrakin / TBT-CVPR2020
View on GitHub
In the repository we provide a sample code to implement the Targeted Bit Trojan attack.
☆20Nov 7, 2020Updated 5 years ago
pasquini-dario / LLM_NeuralExec
View on GitHub
Code to generate NeuralExecs (prompt injection for LLMs)
☆27Oct 5, 2025Updated 4 months ago
dreadnode / research
View on GitHub
General research for Dreadnode
☆27Jun 17, 2024Updated last year
val-iisc / GAMA-GAT
View on GitHub
Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020
☆27Dec 23, 2020Updated 5 years ago
snu-mllab / DiscreteBlockBayesAttack
View on GitHub
Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…
☆25Sep 26, 2023Updated 2 years ago
bymavis / CAS_ICLR2021
View on GitHub
☆58Jul 27, 2022Updated 3 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
jiaxiaojunQAQ / FOA-Attack
View on GitHub
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment (NeurIPS 2025)
☆49Nov 5, 2025Updated 3 months ago
sslab-gatech / RoboFuzz
View on GitHub
Fuzzing framework for Robot Operating System (ROS) and ROS-based robotic systems
☆35Jul 7, 2025Updated 7 months ago
SheltonLiu-N / Universal-Prompt-Injection
View on GitHub
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
☆69Oct 23, 2024Updated last year
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆31Jan 10, 2025Updated last year
ttbrunner / biased_boundary_attack
View on GitHub
Implementation of the Biased Boundary Attack for ImageNet
☆22Aug 18, 2019Updated 6 years ago
OSU-NLP-Group / AgentSafety
View on GitHub
☆178Oct 31, 2025Updated 4 months ago
P2333 / SCORE
View on GitHub
A Self-Consistent Robust Error (ICML 2022)
☆69Jun 25, 2023Updated 2 years ago
SaFo-Lab / JailBreakV_28K
View on GitHub
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆88May 9, 2025Updated 9 months ago
sleeepeer / PoisonedRAG
View on GitHub
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆236Jan 27, 2026Updated last month
tmllab / 2025_ICLR_PiF
View on GitHub
☆39May 17, 2025Updated 9 months ago
GreyDGL / ShareGPTs
View on GitHub
☆34Dec 2, 2023Updated 2 years ago
boyellow / AdaAD
View on GitHub
Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).
☆34May 26, 2023Updated 2 years ago
shreyansh26 / Red-Teaming-Language-Models-with-Language-Models
View on GitHub
A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022
☆35Oct 9, 2023Updated 2 years ago
ssg-research / dawn-dynamic-adversarial-watermarking-of-neural-networks
View on GitHub
Watermarking against model extraction attacks in MLaaS. ACM MM 2021.
☆34Jul 15, 2021Updated 4 years ago