An easy-to-use Python framework to defend against jailbreak prompts.
☆21Mar 22, 2025Updated last year
Alternatives and similar repositories for llm-defense
Users that are interested in llm-defense are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆22Mar 22, 2025Updated last year
- Red Queen Dataset and data generation template☆27Dec 26, 2025Updated 3 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated 2 years ago
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆14Jun 3, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [CVPR 2024] "Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training", Di Ming, Peng Ren, Yunlong Wang, Xin …☆16Jun 11, 2024Updated last year
- BASAR:Black-box Attack on Skeletal Action Recognition, CVPR 2021☆19Feb 18, 2025Updated last year
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆24Jun 9, 2024Updated last year
- ☆13Nov 11, 2022Updated 3 years ago
- ☆13Mar 29, 2021Updated 5 years ago
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- A Unified Approach to Interpreting and Boosting Adversarial Transferability (ICLR2021)☆31Apr 22, 2022Updated 3 years ago
- Some vulnerability research slides that I made☆12Jan 5, 2022Updated 4 years ago
- my poc☆16Oct 28, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ICL backdoor attack☆17Nov 4, 2024Updated last year
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆27Jul 6, 2024Updated last year
- ☆13Dec 22, 2017Updated 8 years ago
- Mobile Security - OMTG-Android Walkthrough☆11Oct 31, 2019Updated 6 years ago
- ☆34Nov 12, 2024Updated last year
- URL-encode data streams via commandline☆14Oct 26, 2019Updated 6 years ago
- Tool to get the top android apps for bug bounty purpose☆17Sep 10, 2020Updated 5 years ago
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?"☆21Dec 26, 2025Updated 3 months ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A substitute repository put up on public demand for the original Awesome WAF repository (https://github.com/0xInfection/Awesome-WAF) whic…☆12May 3, 2019Updated 6 years ago
- SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references fro…☆15Sep 14, 2025Updated 6 months ago
- Some exploits to bypass Safer Mode in Ghostscript☆18Oct 24, 2020Updated 5 years ago
- Push notifications to Slack channel or to custom server based on BurpSuite response conditions.☆17Nov 26, 2020Updated 5 years ago
- some codeql rules☆15Apr 6, 2020Updated 6 years ago
- ☆55Dec 7, 2024Updated last year
- A python library to build graphs for programs written in different programming languages.☆13May 6, 2022Updated 3 years ago
- 毕业设计。Keywords: 层次聚类、谱聚类、WordNet☆10Jun 29, 2014Updated 11 years ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of 180+ useful Claude Skills for Software Engineering and resources for customizing AI for SE workflows.☆66Feb 22, 2026Updated last month
- 常见漏洞环境dockerfile☆18Jan 13, 2021Updated 5 years ago
- Static code search python lib☆18Oct 27, 2020Updated 5 years ago
- sshchecker is a fast dedicated ssh brute-forcing tool to check ssh login on the giving IP list.☆23Feb 8, 2026Updated 2 months ago
- CircleCI log and security configuration automations☆22Sep 20, 2020Updated 5 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- Fine-tuning base models to build robust task-specific models☆35Apr 11, 2024Updated last year