llm-platform-security / chatgpt-plugin-evalLinks

LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins

☆28

Alternatives and similar repositories for chatgpt-plugin-eval

Users that are interested in chatgpt-plugin-eval are comparing it to the libraries listed below

Sorting:

eth-sri / sven
☆124Updated last year
PurduePAML / Machine-Learning-Security-Seminar
Machine Learning & Security Seminar @Purdue University
☆25Updated 2 years ago
iliaishacked / sponge_examples
☆25Updated 4 years ago
briland / LLM-security-and-privacy
LLM security and privacy
☆52Updated last year
s2e-lab / SecurityEval
Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…
☆82Updated 2 years ago
surrealyz / verified-global-properties
Learning Security Classifiers with Verified Global Robustness Properties (CCS'21) https://arxiv.org/pdf/2105.11363.pdf
☆28Updated 3 years ago
BHui97 / PLeak
☆69Updated 11 months ago
liu00222 / Open-Prompt-Injection
This repository provides a benchmark for prompt injection attacks and defenses
☆346Updated last month
facebookresearch / SecAlign
Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"
☆75Updated 4 months ago
tuhh-softsec / LLMSecEval
☆54Updated last year
llm-platform-security / SecGPT
An Execution Isolation Architecture for LLM-Based Agentic Systems
☆100Updated 9 months ago
reza321 / T-Miner
☆19Updated last year
LostOxygen / llm-confidentiality
Whispers in the Machine: Confidentiality in Agentic Systems
☆41Updated 3 weeks ago
Sizhe-Chen / StruQ
official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries
☆52Updated 2 weeks ago
microsoft / analysing_pii_leakage
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆100Updated last year
AI-secure / AgentPoison
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆168Updated 7 months ago
niuliang42 / CodexLeaks
CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot
☆11Updated 2 years ago
wagner-group / prompt-injection-defense
Fine-tuning base models to build robust task-specific models
☆34Updated last year
agiresearch / ASB
Agent Security Bench (ASB)
☆147Updated last month
ZhangZhuoSJTU / LINT
☆17Updated last year
SolidShen / BAIT
🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access
☆46Updated 5 months ago
chawins / pal
PAL: Proxy-Guided Black-Box Attack on Large Language Models
☆55Updated last year
byerose / Awesome-Foundation-Model-Security
A curated list of trustworthy Generative AI papers. Daily updating...
☆75Updated last year
Testing4AI / DeepJudge
Code release for DeepJudge (S&P'22)
☆52Updated 2 years ago
CryptoAILab / JailbreakEval
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
☆172Updated 7 months ago
datasec-lab / CodeBreaker
[USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…
☆53Updated 8 months ago
xirui-li / DrAttack
Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
☆66Updated last year
ClonedOne / MalwareBackdoors
Code for the paper Explanation-Guided Backdoor Poisoning Attacks Against Malware Classifiers
☆59Updated 3 years ago
arobey1 / smooth-llm
☆112Updated 2 years ago
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆357Updated last month