liu00222/Open-Prompt-Injection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liu00222/Open-Prompt-Injection)

liu00222 / Open-Prompt-Injection

This repository provides a benchmark for prompt injection attacks and defenses in LLMs

☆465

Alternatives and similar repositories for Open-Prompt-Injection

Users that are interested in Open-Prompt-Injection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uiuc-kang-lab / InjecAgent
View on GitHub
☆152Jul 2, 2024Updated 2 years ago
Sizhe-Chen / StruQ
View on GitHub
official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries
☆76Nov 10, 2025Updated 8 months ago
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆670Jun 2, 2026Updated last month
facebookresearch / SecAlign
View on GitHub
Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"
☆98Jul 2, 2026Updated 2 weeks ago
SheltonLiu-N / Universal-Prompt-Injection
View on GitHub
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
☆73Oct 23, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sleeepeer / PIArena
View on GitHub
[ACL 2026] PIArena: A Platform for Prompt Injection Evaluation
☆39Apr 28, 2026Updated 2 months ago
patrickrchao / JailbreakingLLMs
View on GitHub
☆756Jul 2, 2025Updated last year
microsoft / BIPIA
View on GitHub
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆147Apr 15, 2024Updated 2 years ago
chawins / llm-sp
View on GitHub
Papers and resources related to the security and privacy of LLMs 🤖
☆579Jun 8, 2025Updated last year
sleeepeer / PISanitizer
View on GitHub
PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
☆18Dec 10, 2025Updated 7 months ago
sleeepeer / PoisonedRAG
View on GitHub
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆285Jan 27, 2026Updated 5 months ago
facebookresearch / Meta_SecAlign
View on GitHub
Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".
☆70Jun 11, 2026Updated last month
albert-y1n / PISmith
View on GitHub
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
☆21Updated this week
sherdencooper / GPTFuzz
View on GitHub
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
☆599Feb 27, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
agiresearch / ASB
View on GitHub
Agent Security Bench (ASB)
☆271Apr 16, 2026Updated 3 months ago
khhung-906 / Attention-Tracker
View on GitHub
Code for our NAACL2025 accepted paper: Attention Tracker: Detecting Prompt Injection Attacks in LLMs
☆28Sep 19, 2025Updated 10 months ago
tml-epfl / llm-adaptive-attacks
View on GitHub
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]
☆390Jan 23, 2025Updated last year
facebookresearch / wasp
View on GitHub
Official implementation of the WASP web agent security benchmark
☆98Apr 13, 2026Updated 3 months ago
yueliu1999 / FlipAttack
View on GitHub
[ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".
☆178May 2, 2025Updated last year
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆139Feb 19, 2025Updated last year
tldrsec / prompt-injection-defenses
View on GitHub
Every practical and proposed defense against prompt injection.
☆713Feb 22, 2025Updated last year
Sizhe-Chen / DefensiveToken
View on GitHub
Repo for the paper "Defending Against Prompt Injection With a Few DefensiveTokens"
☆19Nov 10, 2025Updated 8 months ago
JailbreakBench / jailbreakbench
View on GitHub
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
☆632Apr 4, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Norrrrrrr-lyn / WAInjectBench
View on GitHub
Benchmarking prompt injection detections for web agents.
☆20Jul 10, 2026Updated last week
CryptoAILab / Awesome-LM-SSP
View on GitHub
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆2,019Jun 17, 2026Updated last month
ShiJiawenwen / JudgeDeceiver
View on GitHub
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆41Sep 17, 2025Updated 10 months ago
Princeton-SysML / Jailbreak_LLM
View on GitHub
☆203Nov 26, 2023Updated 2 years ago
OSU-NLP-Group / RedTeamCUA
View on GitHub
[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
☆57Feb 9, 2026Updated 5 months ago
AI-secure / AgentPoison
View on GitHub
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆230Jun 17, 2026Updated last month
yueliu1999 / Awesome-Jailbreak-on-LLMs
View on GitHub
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, data…
☆1,533Jun 7, 2026Updated last month
SheltonLiu-N / AutoDAN
View on GitHub
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…
☆453Jan 22, 2025Updated last year
RICommunity / TAP
View on GitHub
TAP: An automated jailbreaking method for black-box LLMs
☆241Dec 10, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uiuc-kang-lab / AdaptiveAttackAgent
View on GitHub
☆38Mar 12, 2025Updated last year
wagner-group / prompt-injection-defense
View on GitHub
Fine-tuning base models to build robust task-specific models
☆36Apr 11, 2024Updated 2 years ago
llm-attacks / llm-attacks
View on GitHub
Universal and Transferable Attacks on Aligned Language Models
☆4,741Aug 2, 2024Updated last year
usail-hkust / JailTrickBench
View on GitHub
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)
☆166Nov 30, 2024Updated last year
SaFo-Lab / AgentDyn
View on GitHub
The official implementation of the paper "AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?"
☆68May 19, 2026Updated 2 months ago
GraySwanAI / nanoGCG
View on GitHub
A fast + lightweight implementation of the GCG algorithm in PyTorch
☆343May 13, 2025Updated last year
EasyJailbreak / EasyJailbreak
View on GitHub
An easy-to-use Python framework to generate adversarial jailbreak prompts.
☆873Mar 30, 2026Updated 3 months ago