ATIpiu / SafeGenInjectLinks

全球AI攻防挑战赛—赛道一：大模型生图安全疫苗注入第二名解题方案

☆24

Alternatives and similar repositories for SafeGenInject

Users that are interested in SafeGenInject are comparing it to the libraries listed below

Sorting:

xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆190Updated 7 months ago
LLM-DRA / DRA
[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise a…
☆106Updated 11 months ago
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆215Updated last year
shenyizg / NewAdversarialAttackPaper
A list of recent adversarial attack and defense papers (including those on large language models)
☆43Updated last week
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆388Updated this week
BillChan226 / SafeWatch
[ICLR 2025] Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanati…
☆37Updated 7 months ago
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆228Updated 3 weeks ago
yibo-miao / T2VSafetyBench
☆20Updated 10 months ago
ThuCCSLab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆171Updated 3 months ago
STAIR-BUPT / JailBench
JailBench：大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]
☆128Updated 6 months ago
VILA-Lab / M-Attack
[NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the…
☆70Updated 5 months ago
chen37058 / Red-Team-Arxiv-Paper-Update
Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)
☆63Updated this week
shuita2333 / AutoDoS
Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings
☆15Updated 3 weeks ago
selfdefend / Code
☆25Updated 8 months ago
Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆238Updated last year
thu-ml / MMTrustEval
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
☆166Updated 3 months ago
aiPenguin / StopReasoning
☆12Updated 11 months ago
SproutNan / AI-Safety_Benchmark
The official repository for guided jailbreak benchmark
☆19Updated last month
Daisy-Zhang / Awesome-AIGC-Detection
A collection list of AIGC detection related papers.
☆128Updated 11 months ago
WUSTL-CSPL / LLMJailbreak
☆36Updated 11 months ago
sherdencooper / PromptFuzz
☆26Updated 11 months ago
PKU-YuanGroup / Hallucination-Attack
Attack to induce LLMs within hallucinations
☆157Updated last year
zhrli324 / Corba
☆11Updated 4 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
☆50Updated last year
thu-coai / ShieldLM
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
☆211Updated 11 months ago
sail-sg / P-DoS
[ArXiv 2024] Denial-of-Service Poisoning Attacks on Large Language Models
☆21Updated 11 months ago
ai-data-model-safety / ai-data-model-safety.github.io
☆40Updated 9 months ago
ltroin / llm_attack_defense_arena
☆82Updated 3 weeks ago
tmlr-group / DeepInception
[arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"
☆162Updated last year
Haochen-Luo / CroPA
☆48Updated 9 months ago