gitkolento / SecProbeLinks

SecProbe：任务驱动式大模型安全能力评测系统

☆14

Alternatives and similar repositories for SecProbe

Users that are interested in SecProbe are comparing it to the libraries listed below

Sorting:

ydyjya / Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide…
☆1,658Updated last week
STAIR-BUPT / STAIR-LLMGuardrails
☆12Updated last year
STAIR-BUPT / JailBench
JailBench：大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]
☆135Updated 7 months ago
whitzard-ai / jade-db
"他山之石、可以攻玉"：复旦白泽智能发布面向国内开源和国外商用大模型的Demo数据集JADE-DB
☆468Updated last week
CryptoAILab / Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆1,717Updated this week
WhitzardIndex / WhitzardBench-2024A
复旦白泽大模型安全基准测试集（2024年夏季版）
☆48Updated last year
sleeepeer / PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆210Updated 8 months ago
gitkolento / Adversarial-Attacks-on-LLMs
针对大语言模型的对抗性攻击总结
☆37Updated last year
DPamK / BadAgent
☆23Updated 8 months ago
sherdencooper / PromptFuzz
☆26Updated last year
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆200Updated 8 months ago
thu-coai / SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
☆260Updated 3 months ago
liu673 / Awesome-LLM4Security
This project aims to consolidate and share high-quality resources and tools across the cybersecurity domain.
☆266Updated 2 weeks ago
LiuYuHan31 / FPS
☆20Updated last year
thu-coai / ShieldLM
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
☆214Updated last year
chichidd / llm-lora-trojan
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆25Updated last year
YancyKahn / CoA
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆37Updated 9 months ago
ltroin / llm_attack_defense_arena
☆82Updated last month
niconi19 / LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
☆106Updated last year
sherdencooper / GPTFuzz
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
☆533Updated last year
NY1024 / Foundation-Model-Paper-Notes
☆66Updated 5 months ago
LLM-DRA / DRA
[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise a…
☆109Updated last year
chawins / llm-sp
Papers and resources related to the security and privacy of LLMs 🤖
☆539Updated 4 months ago
ydyjya / LLM-IHS-Explanation
☆53Updated last year
agiresearch / ASB
Agent Security Bench (ASB)
☆137Updated 3 weeks ago
Tele-EVOL / AI-Governance
Global AI Safety and Governance: Never Compromise to Vulnerabilities
☆35Updated last month
AI45Lab / ActorAttack
☆108Updated 8 months ago
bboylyg / BackdoorLLM
[NeurIPS 2025] BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
☆226Updated last week
EasyJailbreak / EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
☆742Updated 7 months ago
theshi-1128 / llm-defense
An easy-to-use Python framework to defend against jailbreak prompts.
☆21Updated 7 months ago