针对大语言模型的对抗性攻击总结
☆38Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for Adversarial-Attacks-on-LLMs
Users that are interested in Adversarial-Attacks-on-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆179Oct 7, 2024Updated last year
- Iot-vulhub 自建镜像版☆12May 1, 2022Updated 3 years ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- A Twitter monitoring tool powered by DeepSeek API and steel-browser, featuring AI translation/analysis, automatic screenshots, and multi-…☆12Jan 29, 2025Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆62Oct 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- GDB that can debug Mach-Os on Linux☆15Aug 11, 2017Updated 8 years ago
- ☆12Sep 29, 2024Updated last year
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆13Dec 4, 2025Updated 4 months ago
- a data collection of related work: Toward Understanding Deep Learning Framework Bugs☆18Oct 23, 2023Updated 2 years ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- ☆17Dec 2, 2025Updated 4 months ago
- 🚀 JailbreakBench 是一个用于评估大语言模型(LLM)安全性的测试工具,专注于检测模型对越狱攻击(Jailbreak)的抵抗能力。通过模拟恶意提示词注入、编码攻击和多轮对话操控,量化模型的漏洞风险,并生成详细报告与可视化分析。支持中英文数据集,适用于安全研究…☆31Sep 1, 2025Updated 7 months ago
- 实现对携程网站的酒店评论爬取,并进行数据预处理和基于情感分类的数据分析,使用了jieba评论分词等处理技术,情感词典,特征值提取,机器学习模型等分析预测技术,词云,热力图等可视化技术☆13Jul 15, 2022Updated 3 years ago
- Collections of powerful RL architectures with brief introductions.☆13Nov 20, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Jul 11, 2023Updated 2 years ago
- Attack to induce LLMs within hallucinations☆164May 17, 2024Updated last year
- [ICLR 2025] Official implementation of 'Hidden in the Noise: Two-Stage Robust Watermarking for Images'☆13May 5, 2025Updated 11 months ago
- Search-based Testing Approach of Reinforcement Learning Agent☆19Nov 25, 2024Updated last year
- Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)☆18Oct 22, 2024Updated last year
- 🚜 METR: Message Enhanced Tree-Ring☆21Aug 19, 2024Updated last year
- SecProbe:任务驱动式大模型安全能力评测系统☆15Nov 29, 2024Updated last year
- 参考taviso的代码逆向一下mpengine.dll☆20Jun 30, 2022Updated 3 years ago
- The code for the paper titled as "DifAttack: Query-Efficient Black-Box Attack via Disentangled Feature Space".☆23Feb 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Nov 19, 2022Updated 3 years ago
- go语言的24种设计模式☆13Jan 9, 2025Updated last year
- Watermarking papers☆17Mar 31, 2026Updated 2 weeks ago
- Client for the Toggl API built for async and await support☆19Jun 14, 2024Updated last year
- 7bits安全团队-《Java安全-记一次实战使用memoryshell》代码样例☆19Nov 13, 2022Updated 3 years ago
- ☆15Mar 10, 2025Updated last year
- ☆13Oct 6, 2022Updated 3 years ago
- 一个fuzzdb扩展库 弱密码和各语言网站后台/漏洞/备份文件路径☆13Feb 10, 2019Updated 7 years ago
- cr3 CTF 2024☆15May 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 一些用于互联网算法岗面试复习用的常见手撕代码合集:排序算法、最短路算法、二叉树遍历算法、sql语句、nms算法、IOU算法、多头注意力MHA等☆22Mar 18, 2025Updated last year
- 基于Spring Boot的迷你天猫商城,快速部署运行,所用技术:Spring Boot/MySQL/Druid/Log4j2/Maven/Echarts/Bootstrap☆14Aug 23, 2021Updated 4 years ago
- Data creation, training and eval scripts for the IRCoder paper☆21May 31, 2024Updated last year
- TRPO Implementation in Tensorflow 2.0 for Reinforcement Learning Project @ Sapienza☆16Mar 25, 2023Updated 3 years ago
- CVE-2024-35250 的 Beacon Object File (BOF) 实现。☆24Nov 28, 2024Updated last year
- ICCV 2021☆14Oct 6, 2021Updated 4 years ago
- A lightweight server for LightGBM☆15Oct 16, 2020Updated 5 years ago