lve-org / lveLinks

A repository of Language Model Vulnerabilities and Exposures (LVEs).

☆113

Alternatives and similar repositories for lve

Users that are interested in lve are comparing it to the libraries listed below

Sorting:

lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆124Updated 3 weeks ago
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆226Updated last week
andyzorigin / cybench
☆127Updated last month
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
dropbox / llm-security
Dropbox LLM Security research code and results
☆231Updated last year
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆64Updated last month
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
microsoft / TaskTracker
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…
☆62Updated 4 months ago
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆23Updated last year
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆73Updated last year
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆401Updated last year
vinusankars / BEAST
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆90Updated last year
liu00222 / Open-Prompt-Injection
This repository provides a benchmark for prompt Injection attacks and defenses
☆255Updated 3 weeks ago
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆402Updated last year
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆303Updated last year
trailofbits / awesome-ml-security
☆139Updated 2 months ago
sinanw / llm-security-prompt-injection
This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…
☆43Updated last year
briland / LLM-security-and-privacy
LLM security and privacy
☆49Updated 9 months ago
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆503Updated 5 months ago
sunblaze-ucb / cybergym
CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…
☆49Updated last week
tml-epfl / llm-adaptive-attacks
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]
☆324Updated 6 months ago
Libr-AI / OpenRedTeaming
Papers about red teaming LLMs and Multimodal models.
☆131Updated 2 months ago
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆530Updated last week
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆59Updated this week
NickNameInvalid / LLM_CTF
☆65Updated 6 months ago
Reapor-Yurnero / imprompter
Codebase of https://arxiv.org/abs/2410.14923
☆49Updated 9 months ago
ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆140Updated 2 months ago
haizelabs / redteaming-resistance-benchmark
☆45Updated last year
HumanCompatibleAI / tensor-trust
A prompt injection game to collect data for robust ML research
☆62Updated 6 months ago
mik0w / pallms
Payloads for Attacking Large Language Models
☆92Updated 2 months ago