safellama / plexiglassLinks

A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).

☆151

Alternatives and similar repositories for plexiglass

Users that are interested in plexiglass are comparing it to the libraries listed below

Sorting:

ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆145Updated 6 months ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆238Updated 9 months ago
fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆430Updated last year
dropbox / llm-security
Dropbox LLM Security research code and results
☆245Updated last year
usnistgov / dioptra
Test Software for the Characterization of AI Technologies
☆265Updated this week
StavC / Here-Comes-the-AI-Worm
Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems
☆218Updated 2 months ago
JosephTLucas / jupysec
A JupyterLab extension to evaluate the security of your Jupyter environment
☆39Updated 2 years ago
dreadnode / rigging
Lightweight LLM Interaction Framework
☆394Updated last week
compl-ai / compl-ai
An open-source compliance-centered evaluation framework for Generative AI models
☆172Updated 2 weeks ago
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆151Updated 5 months ago
haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆99Updated 7 months ago
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆437Updated last year
llmsecnet / llmsec-site
source for llmsec.net
☆16Updated last year
hwchase17 / adversarial-prompts
Curation of prompts that are known to be adversarial to large language models
☆186Updated 2 years ago
forcesunseen / llm-hackers-handbook
A guide to LLM hacking: fundamentals, prompt injection, offense, and defense
☆176Updated 2 years ago
wunderwuzzi23 / mlattacks
Machine Learning Attack Series
☆70Updated last year
agentfence / agentfence
AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…
☆42Updated 8 months ago
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆24Updated 2 years ago
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆591Updated 2 months ago
mithril-security / blindbox
BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps
☆62Updated 2 years ago
RiccardoBiosas / awesome-MLSecOps
A curated list of MLSecOps tools, articles and other resources on security applied to Machine Learning and MLOps systems.
☆399Updated 3 months ago
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆150Updated 3 months ago
corca-ai / LLMFuzzAgent
[Corca / ML] Automatically solved Gandalf AI with LLM
☆52Updated 2 years ago
sshh12 / llm_backdoor
Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…
☆191Updated last month
mik0w / pallms
Payloads for Attacking Large Language Models
☆106Updated 5 months ago
mitre-atlas / atlas-data
ATLAS tactics, techniques, and case studies data
☆88Updated this week
Giskard-AI / awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
☆193Updated 7 months ago
mrwadams / honeyagents
HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …
☆58Updated last year
protectai / nbdefense
Secure Jupyter Notebooks and Experimentation Environment
☆84Updated 9 months ago