sinanw / llm-security-prompt-injectionLinks

This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.

☆43

Alternatives and similar repositories for llm-security-prompt-injection

Users that are interested in llm-security-prompt-injection are comparing it to the libraries listed below

Sorting:

ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆140Updated 2 months ago
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆124Updated 2 weeks ago
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆400Updated last year
liu00222 / Open-Prompt-Injection
This repository provides a benchmark for prompt Injection attacks and defenses
☆250Updated 2 weeks ago
briland / LLM-security-and-privacy
LLM security and privacy
☆49Updated 9 months ago
Libr-AI / OpenRedTeaming
Papers about red teaming LLMs and Multimodal models.
☆130Updated 2 months ago
andyzorigin / cybench
☆127Updated last month
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆113Updated last year
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆226Updated last week
ThuCCSLab / JailbreakEval
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
☆165Updated 4 months ago
patrickrchao / JailbreakingLLMs
☆588Updated last month
RiccardoBiosas / awesome-MLSecOps
A curated list of MLSecOps tools, articles and other resources on security applied to Machine Learning and MLOps systems.
☆340Updated this week
dropbox / llm-security
Dropbox LLM Security research code and results
☆231Updated last year
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆399Updated last year
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆23Updated last year
mik0w / pallms
Payloads for Attacking Large Language Models
☆92Updated 2 months ago
FonduAI / awesome-prompt-injection
Learn about a type of vulnerability that specifically targets machine learning models
☆314Updated last year
TrustAI-laboratory / LMAP
LMAP (large language model mapper) is like NMAP for LLM, is an LLM Vulnerability Scanner and Zero-day Vulnerability Fuzzer.
☆22Updated 9 months ago
uiuc-kang-lab / InjecAgent
☆70Updated last year
Babelscape / ALERT
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
☆44Updated 10 months ago
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆73Updated last year
AIM-Intelligence / Automated-Multi-Turn-Jailbreaks
☆81Updated 8 months ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
precize / Agentic-AI-Top10-Vulnerability
Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work
☆124Updated last month
tml-epfl / llm-adaptive-attacks
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]
☆324Updated 6 months ago
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆530Updated this week
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆303Updated last year
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆503Updated 5 months ago
vinusankars / BEAST
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆89Updated last year