google-research / camel-prompt-injectionLinks

Code for the paper "Defeating Prompt Injections by Design"

☆64

Alternatives and similar repositories for camel-prompt-injection

Users that are interested in camel-prompt-injection are comparing it to the libraries listed below

Sorting:

lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆113Updated last year
andyzorigin / cybench
☆130Updated last month
microsoft / TaskTracker
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…
☆62Updated 5 months ago
sigstore / model-transparency
Supply chain security for ML
☆181Updated this week
invariantlabs-ai / mcp-injection-experiments
Code snippets to reproduce MCP tool poisoning attacks.
☆164Updated 3 months ago
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆124Updated 3 weeks ago
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆23Updated last year
Reapor-Yurnero / imprompter
Codebase of https://arxiv.org/abs/2410.14923
☆49Updated 9 months ago
NickNameInvalid / LLM_CTF
☆65Updated 6 months ago
dropbox / llm-security
Dropbox LLM Security research code and results
☆232Updated last year
invariantlabs-ai / invariant
Guardrails for secure and robust agent development
☆327Updated last week
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆230Updated this week
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆59Updated last week
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆73Updated last year
pasquini-dario / LLMmap
☆51Updated 2 weeks ago
dreadnode / parley
Tree of Attacks (TAP) Jailbreaking Implementation
☆114Updated last year
ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆141Updated 2 months ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
wearetyomsmnv / Awesome-LLMSecOps
LLM | Security | Operations in one github repo with good links and pictures.
☆35Updated 7 months ago
dreadnode / tensor-man
A utility to inspect, validate, sign and verify machine learning model files.
☆57Updated 6 months ago
logic-star-ai / baxbench
☆52Updated 5 months ago
dreadnode / rigging
Lightweight LLM Interaction Framework
☆313Updated this week
trailofbits / awesome-ml-security
☆139Updated 2 months ago
sshh12 / llm_backdoor
Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…
☆179Updated 4 months ago
precize / Agentic-AI-Top10-Vulnerability
Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work
☆124Updated last month
humane-intelligence / ai_village_defcon_grt_data
☆14Updated last year
invariantlabs-ai / invariant-gateway
LLM proxy to observe and debug what your AI agents are doing.
☆41Updated 3 weeks ago
dreadnode / robopages
A YAML based format for describing tools to LLMs, like man pages but for robots!
☆75Updated 3 months ago
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆503Updated 5 months ago