ZenGuard-AI / fast-llm-security-guardrailsLinks

The fastest Trust Layer for AI Agents

☆144

Alternatives and similar repositories for fast-llm-security-guardrails

Users that are interested in fast-llm-security-guardrails are comparing it to the libraries listed below

Sorting:

deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆426Updated last year
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆235Updated 9 months ago
sinanw / llm-security-prompt-injection
This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…
☆53Updated last year
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆587Updated last month
invariantlabs-ai / invariant
Guardrails for secure and robust agent development
☆364Updated 3 months ago
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
briland / LLM-security-and-privacy
LLM security and privacy
☆51Updated last year
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆150Updated 5 months ago
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆433Updated last year
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆148Updated 2 months ago
fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
precize / Agentic-AI-Top10-Vulnerability
Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work
☆151Updated last month
andyzorigin / cybench
☆168Updated 5 months ago
raga-ai-hub / raga-llm-hub
Framework for LLM evaluation, guardrails and security
☆113Updated last year
safellama / plexiglass
A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).
☆151Updated last year
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆88Updated last year
llm-platform-security / SecGPT
An Execution Isolation Architecture for LLM-Based Agentic Systems
☆98Updated 9 months ago
hwchase17 / langfuzz
☆74Updated last year
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆24Updated 2 years ago
haizelabs / redteaming-resistance-benchmark
☆49Updated last year
hupe1980 / aisploit
🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.
☆27Updated last year
agentfence / agentfence
AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…
☆42Updated 8 months ago
leondz / autoredteam
autoredteam: code for training models that automatically red team other language models
☆13Updated 2 years ago
confident-ai / deepteam
DeepTeam is a framework to red team LLMs and LLM systems.
☆943Updated this week
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆579Updated 8 months ago
usnistgov / dioptra
Test Software for the Characterization of AI Technologies
☆262Updated last week
microsoft / TaskTracker
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…
☆74Updated 2 months ago
invariantlabs-ai / mcp-injection-experiments
Code snippets to reproduce MCP tool poisoning attacks.
☆187Updated 7 months ago
dropbox / llm-security
Dropbox LLM Security research code and results
☆243Updated last year