Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆22Updated last year
Alternatives and similar repositories for prompt-injection-mitigations
Users that are interested in prompt-injection-mitigations are comparing it to the libraries listed below
Sorting:
- Risks and targets for assessing LLMs & LLM vulnerabilities☆30Updated 11 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆108Updated last year
- using ML models for red teaming☆43Updated last year
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆23Updated last year
- ATLAS tactics, techniques, and case studies data☆71Updated 3 weeks ago
- Payloads for Attacking Large Language Models☆83Updated 10 months ago
- ☆37Updated 7 months ago
- Codebase of https://arxiv.org/abs/2410.14923☆47Updated 6 months ago
- Top 10 for Agentic AI (AI Agent Security)☆99Updated 2 months ago
- LLM security and privacy☆49Updated 7 months ago
- Data Scientists Go To Jupyter☆63Updated 2 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆109Updated 4 months ago
- [IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the vict…☆41Updated 2 months ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆86Updated last year
- https://arxiv.org/abs/2412.02776☆54Updated 5 months ago
- Dropbox LLM Security research code and results☆225Updated 11 months ago
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆52Updated last week
- LLM | Security | Operations in one github repo with good links and pictures.☆29Updated 4 months ago
- General research for Dreadnode☆23Updated 10 months ago
- A library to produce cybersecurity exploitation routes (exploit flows). Inspired by TensorFlow.☆35Updated last year
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆109Updated last year
- ☆65Updated 3 months ago
- ☆40Updated last week
- 🤖 A GitHub action that leverages fabric patterns through an agent-based approach☆26Updated 4 months ago
- This repository provides a benchmark for prompt Injection attacks and defenses☆196Updated 2 weeks ago
- LMAP (large language model mapper) is like NMAP for LLM, is an LLM Vulnerability Scanner and Zero-day Vulnerability Fuzzer.☆10Updated 7 months ago
- A benchmark for prompt injection detection systems.☆110Updated this week
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆11Updated 5 months ago
- ☆100Updated 2 months ago
- source code for the offsecml framework☆40Updated 11 months ago