Valhall-ai / prompt-injection-mitigationsLinks

A collection of prompt injection mitigation techniques.

☆23

Alternatives and similar repositories for prompt-injection-mitigations

Users that are interested in prompt-injection-mitigations are comparing it to the libraries listed below

Sorting:

leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
precize / Agentic-AI-Top10-Vulnerability
Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work
☆124Updated last month
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆113Updated last year
mitre-atlas / atlas-data
ATLAS tactics, techniques, and case studies data
☆77Updated 3 months ago
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆124Updated 3 weeks ago
dreadnode / parley
Tree of Attacks (TAP) Jailbreaking Implementation
☆114Updated last year
dropbox / llm-security
Dropbox LLM Security research code and results
☆232Updated last year
subzer0girl2 / AI-Threat-Mind-Map
☆42Updated 7 months ago
Reapor-Yurnero / imprompter
Codebase of https://arxiv.org/abs/2410.14923
☆49Updated 9 months ago
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆402Updated last year
mik0w / pallms
Payloads for Attacking Large Language Models
☆92Updated 2 months ago
andyzorigin / cybench
☆130Updated last month
moohax / Charcuterie
Data Scientists Go To Jupyter
☆65Updated 5 months ago
protectai / nbdefense
Secure Jupyter Notebooks and Experimentation Environment
☆78Updated 6 months ago
xsankar / AI-Red-Teaming
All things specific to LLM Red Teaming Generative AI
☆28Updated 9 months ago
NickNameInvalid / LLM_CTF
☆65Updated 6 months ago
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆503Updated 5 months ago
dreadnode / AIRTBench-Code
Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models
☆68Updated this week
invariantlabs-ai / mcp-injection-experiments
Code snippets to reproduce MCP tool poisoning attacks.
☆164Updated 3 months ago
pasquini-dario / LLMmap
☆51Updated 2 weeks ago
BishopFox / BrokenHill
A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)
☆123Updated 7 months ago
NetsecExplained / Attacking-and-Defending-Generative-AI
Reference notes for Attacking and Defending Generative AI presentation
☆64Updated last year
trailofbits / awesome-ml-security
☆139Updated 2 months ago
bsinger98 / Incalmo
☆45Updated this week
PalisadeResearch / intercode
https://arxiv.org/abs/2412.02776
☆59Updated 8 months ago
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆303Updated last year
5stars217 / offsecml
source code for the offsecml framework
☆41Updated last year
vinusankars / BEAST
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆90Updated last year
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆59Updated last week
ReversecLabs / spikee
☆61Updated 2 weeks ago