gitkolento / Adversarial-Attacks-on-LLMsLinks

针对大语言模型的对抗性攻击总结

☆31

Alternatives and similar repositories for Adversarial-Attacks-on-LLMs

Users that are interested in Adversarial-Attacks-on-LLMs are comparing it to the libraries listed below

Sorting:

SecureNexusLab / LLMPromptAttackGuide
☆142Updated 9 months ago
STAIR-BUPT / JailBench
JailBench：大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]
☆106Updated 4 months ago
WhitzardIndex / WhitzardBench-2024A
复旦白泽大模型安全基准测试集（2024年夏季版）
☆41Updated 11 months ago
LLMSmith / LLMSmith
☆33Updated 4 months ago
liu673 / Awesome-LLM4Security
This project aims to consolidate and share high-quality resources and tools across the cybersecurity domain.
☆220Updated last week
LLM-DRA / DRA
[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise a…
☆99Updated 9 months ago
Zhou-Zi7 / Awesome-AI-Security-BIG4
This Github repository summarizes a list of research papers on AI security from the four top academic conferences.
☆132Updated last month
datawhalechina / ml-for-security
网络安全中的人工智能方法，总结我在中国科学院大学的学习以及自己整理的资料、方法，给大家分享出来
☆74Updated 2 months ago
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆183Updated 4 months ago
ShenaoW / CISCNThesis
全国大学生信息安全竞赛作品赛非官方 LaTex 论文模板
☆25Updated last year
xu-xiang / awesome-security-vul-llm
本项目通过大模型联动爬虫，检索Github上所有存有有价值漏洞信息与漏洞POC或规则信息的项目，并自动识别项目的目录结构、Readme信息后进行总结分析并分类，所汇总的项目可以帮助安全行业从业者收集漏洞信息、POC信息、规则等。
☆135Updated last year
RiskySignal / record_what_i_read
AI Model Security Reading Notes
☆39Updated 4 months ago
sherdencooper / PromptFuzz
☆26Updated 8 months ago
y4ney / LLM-Security
LLM 安全资料收集与学习
☆24Updated last year
mactavishmeng / paperSearcher
一个搜索网络安全领域顶会论文的小工具
☆87Updated 8 months ago
ddzipp / AutoAudit
AutoAudit—— the LLM for Cyber Security 网络安全大语言模型
☆340Updated 4 months ago
ZJUICSR / AIcert
☆223Updated last year
Clouditera / Clouditera.github.io
塑造未来的安全领域智能革命
☆628Updated 5 months ago
XFR1998 / CCF-BDCI2022-Web-Attack-Detection-and-Classification
此仓库代码为本人参加的CCF-BDCI-2022 赛道：Web攻击检测与分类识别 (多分类任务)，比赛rank-23。队员：Furen Xu
☆15Updated 2 years ago
SunZhimin2021 / AIPentest
LLM can support pentest, step by step
☆110Updated last month
XMoyas / AI_CyberSecurity_Resources
机器学习(Machine learing)、网络安全(CyberSecurity)、大模型、数据集、AI竞赛
☆51Updated this week
tangzichengcc / The_Growth_Path_Of_A_CTFer_And_Pwner
作者目前在中科院某所攻读研究生,对网络安全,CTF非常感兴趣.学习的领域包括但不限于PWN、系统安全、红队攻防等. 这个仓库会记录自己的成长历程以及学习过程中整理的资料.
☆134Updated 2 years ago
chichidd / llm-lora-trojan
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆19Updated 10 months ago
sherdencooper / GPTFuzz
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
☆504Updated 9 months ago
AnchoretY / AI_Security_Library
Ai与Web安全相关资料的总结库，包括认为写的比较好的一些博客、项目、数据等
☆82Updated 2 years ago
weiyezhimeng / SQL-Injection-Jailbreak
☆13Updated last week
mxcrafts / ltrack
Security Observability Framework for ML/AI Model File Loading
☆35Updated 3 weeks ago
verazuo / badnets-pytorch
Simple PyTorch implementations of Badnets on MNIST and CIFAR10.
☆177Updated 2 years ago
librauee / Steganalysis
🦄Python 实现LSB算法进行信息隐藏包含空域与变换域 JPEG信息隐藏算法对PDF文件进行信息隐藏基于卷积神经网络的隐写分析 Matlab SRM、SCA隐写分析
☆151Updated 5 years ago
darkcodetop / AI-Security-Awesome
收集了人工智能在网络安全领域的比赛、应用案例和博客。
☆22Updated 5 years ago