yibo-miao/T2VSafetyBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yibo-miao/T2VSafetyBench)

yibo-miao / T2VSafetyBench

☆25

Alternatives and similar repositories for T2VSafetyBench

Users that are interested in T2VSafetyBench are comparing it to the libraries listed below

Sorting:

ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆16Jul 15, 2024Updated last year
aiPenguin / StopReasoning
View on GitHub
☆14Oct 6, 2024Updated last year
zhangsn-19 / PAN
View on GitHub
Code and data for PAN and PAN-phys.
☆13Mar 20, 2023Updated 2 years ago
thu-ml / MMTrustEval
View on GitHub
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
☆174Jun 27, 2025Updated 8 months ago
tmllab / 2025_ICLR_PiF
View on GitHub
☆40May 17, 2025Updated 9 months ago
liuaishan / SpatiotemporalAttack
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆94Dec 20, 2025Updated 2 months ago
AIM-Intelligence / Automated-Multi-Turn-Jailbreaks
View on GitHub
☆121Dec 3, 2025Updated 3 months ago
MaTengSYSU / HIMRD-jailbreak
View on GitHub
Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"
☆15Aug 7, 2025Updated 7 months ago
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆35Oct 23, 2024Updated last year
thu-ml / STAIR
View on GitHub
Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"
☆88Feb 26, 2025Updated last year
liuxuannan / Stochastic-Gradient-Aggregation
View on GitHub
Official implementation of the ICCV2023 paper: Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregatio…
☆27Aug 17, 2023Updated 2 years ago
zhaoyiran924 / Probe-Sampling
View on GitHub
[NeurIPS 2024] Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling
☆34Nov 8, 2024Updated last year
HandingWangXDGroup / AGSM-DE
View on GitHub
An Approximated Gradient Sign Method Using Differential Evolution For Black-box Adversarial Attack
☆11Feb 25, 2022Updated 4 years ago
shighghyujie / newpatch-rl
View on GitHub
Simultaneously Optimizing Perturbations and Positions for Black-box Adversarial Patch Attacks (TPAMI 2022)
☆35Feb 9, 2023Updated 3 years ago
sani903 / OpenAgentSafety
View on GitHub
A Framework for Evaluating AI Agent Safety in Realistic Environments
☆30Oct 2, 2025Updated 5 months ago
TRLou / HiT-ADV
View on GitHub
The code of "Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds" CVPR 2024
☆36Mar 23, 2024Updated last year
YancyKahn / CoA
View on GitHub
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆39Jan 17, 2025Updated last year
konpanousis / Adversarial-LWTA-AutoAttack
View on GitHub
☆12May 6, 2022Updated 3 years ago
Aries-iai / TT3D
View on GitHub
The official implementation for "Towards Transferable Targeted 3D Adversarial Attack in the Physical World" (CVPR, 2024))
☆42Aug 6, 2024Updated last year
cure-lab / MMA-Diffusion
View on GitHub
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
☆386Jan 8, 2026Updated 2 months ago
isXinLiu / Awesome-MLLM-Safety
View on GitHub
Accepted by IJCAI-24 Survey Track
☆231Aug 25, 2024Updated last year
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆192Jun 26, 2025Updated 8 months ago
gq-max / AdvDiffVLM
View on GitHub
☆48Apr 7, 2025Updated 11 months ago
BSI-Bund / pySCASso
View on GitHub
☆12Jul 14, 2025Updated 7 months ago
Pi3AI / Ivy-Fake
View on GitHub
☆23Dec 11, 2025Updated 2 months ago
segev-shlomov / ST-WebAgentBench
View on GitHub
A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios
☆19Updated this week
ZuyiZhou / Awesome-Cross-modal-Reasoning-with-LLMs
View on GitHub
☆13Oct 21, 2024Updated last year
jacobocasado / PTHelper
View on GitHub
A penetration testing tool to help in Infrastructure pentesting process.
☆11Sep 19, 2023Updated 2 years ago
TrustAIRLab / HateBench
View on GitHub
[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆13Mar 1, 2025Updated last year
YitingQu / unsafe-diffusion
View on GitHub
☆46Jul 14, 2024Updated last year
verazuo / prompt-stealing-attack
View on GitHub
[USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models
☆51Jan 11, 2025Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆192Oct 15, 2024Updated last year
Huang-yihao / Personalization-based_backdoor
View on GitHub
☆10Dec 18, 2024Updated last year
UCLA-SEAL / DeepLearningTest
View on GitHub
Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)
☆10Sep 23, 2021Updated 4 years ago
peterwestuw / GPT2ForwardBackward
View on GitHub
Code for running forward and backward versions of GPT2
☆10Nov 20, 2021Updated 4 years ago
Trustworthy-AI-Group / BSR
View on GitHub
[CVPR 2024] Boosting Adversarial Transferability by Block Shuffle and Rotation
☆13Feb 28, 2024Updated 2 years ago
nehemya / Algo-Trade-Adversarial-Examples
View on GitHub
todo: desc
☆11Aug 12, 2021Updated 4 years ago
eth-sri / privacy-inference-multimodal
View on GitHub
☆20Feb 3, 2025Updated last year