LostOxygen / llm-confidentiality
View external linksLinks

Whispers in the Machine: Confidentiality in Agentic Systems

☆41

Alternatives and similar repositories for llm-confidentiality

Users that are interested in llm-confidentiality are comparing it to the libraries listed below

Sorting:

ZiyueWang25 / llm-security-challenge
View on GitHub
Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the Over…
☆13Aug 21, 2023Updated 2 years ago
briland / LLM-security-and-privacy
View on GitHub
LLM security and privacy
☆53Oct 15, 2024Updated last year
leondz / lm_risk_cards
View on GitHub
Risks and targets for assessing LLMs & LLM vulnerabilities
☆33May 27, 2024Updated last year
ruyimarone / data-portraits
View on GitHub
Documenting large text datasets 🖼️ 📚
☆14Dec 17, 2024Updated last year
levitation-opensource / Manipulative-Expression-Recognition
View on GitHub
MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …
☆14Jan 16, 2026Updated last month
Sizhe-Chen / StruQ
View on GitHub
official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries
☆63Nov 10, 2025Updated 3 months ago
microsoft / purview
View on GitHub
☆23Feb 2, 2026Updated 2 weeks ago
tianshuocong / TePA
View on GitHub
[S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models
☆19Feb 18, 2025Updated 11 months ago
chawins / pal
View on GitHub
PAL: Proxy-Guided Black-Box Attack on Large Language Models
☆57Aug 17, 2024Updated last year
liu00222 / Open-Prompt-Injection
View on GitHub
This repository provides a benchmark for prompt injection attacks and defenses in LLMs
☆391Oct 29, 2025Updated 3 months ago
hupe1980 / aisploit
View on GitHub
🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.
☆26May 16, 2024Updated last year
llm-platform-security / chatgpt-plugin-eval
View on GitHub
LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins
☆29Jul 29, 2024Updated last year
chawins / llm-sp
View on GitHub
Papers and resources related to the security and privacy of LLMs 🤖
☆561Jun 8, 2025Updated 8 months ago
sherdencooper / PromptFuzz
View on GitHub
☆29Oct 23, 2024Updated last year
wegodev2 / virtual-prompt-injection
View on GitHub
Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"
☆27Jul 6, 2024Updated last year
lvpeizhuo / MEA-Defender
View on GitHub
This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.
☆29Nov 19, 2023Updated 2 years ago
tml-epfl / llm-adaptive-attacks
View on GitHub
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]
☆377Jan 23, 2025Updated last year
ethz-spylab / rlhf_trojan_competition
View on GitHub
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
☆116Jun 13, 2024Updated last year
harekrishnarai / flowlyt
View on GitHub
Flowlyt is a security analyzer that scans GitHub Actions workflows to detect malicious patterns, misconfigurations, and secrets exposure,…
☆15Feb 10, 2026Updated last week
JorianWoltjer / nodejs-file-write-rce
View on GitHub
NodeJS File Write to RCE on a read-only filesystem using a ROP chain in libuv
☆37Oct 13, 2024Updated last year
VimalWill / Vstream
View on GitHub
Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)
☆10Feb 2, 2024Updated 2 years ago
Zen-kun04 / CVE-2025-49132
View on GitHub
A script that gives you the credentials of a Pterodactyl panel vulnerable to CVE-2025-49132
☆16Jun 22, 2025Updated 7 months ago
VMnK-Run / MARVEL
View on GitHub
[ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training
☆11Sep 13, 2024Updated last year
ltroin / llm_attack_defense_arena
View on GitHub
☆86Sep 5, 2025Updated 5 months ago
ShiJiawenwen / JudgeDeceiver
View on GitHub
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆39Sep 17, 2025Updated 5 months ago
Lyz1213 / BadEdit
View on GitHub
☆37Oct 17, 2024Updated last year
lyc8503 / FM1208_scripts
View on GitHub
Scripts & Code patches for analyzing/emulating/copying FM1208 CPU Cards (读取复制 SAK28 CPU卡 FM1208)
☆19Mar 7, 2025Updated 11 months ago
bcgov / digital-trust-toolkit
View on GitHub
This toolkit guides you on implementing secure and user-friendly digital and in-person interactions. Whether you are a service owner, dev…
☆10Nov 6, 2025Updated 3 months ago
GenTelLab / trustclaw
View on GitHub
☆39Feb 9, 2026Updated last week
DequanWang / dent
View on GitHub
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks
☆38May 25, 2021Updated 4 years ago
vscheuber / fidc-debug-tools
View on GitHub
ForgeRock Identity Cloud Debug Tools
☆11Jan 27, 2023Updated 3 years ago
netskopeoss / ta_cloud_exchange
View on GitHub
☆14Jan 27, 2026Updated 3 weeks ago
dropbox / llm-security
View on GitHub
Dropbox LLM Security research code and results
☆254May 21, 2024Updated last year
patrickrchao / JailbreakingLLMs
View on GitHub
☆696Jul 2, 2025Updated 7 months ago
ineveLoppiliF / Online-Isolation-Forest
View on GitHub
☆13Jan 16, 2025Updated last year
mogland / console
View on GitHub
🕹️ Powerful Web Console for administers and root｜为管理员制作的网页端博客控制台
☆10Feb 2, 2026Updated 2 weeks ago
msrofficial / WiFuX
View on GitHub
A powerful tool for hacking WiFi using termux.
☆27Updated this week
futureHQ / FutureGPT
View on GitHub
⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…
☆12May 14, 2023Updated 2 years ago
Dwolla / bambot
View on GitHub
BambooHR Slack bot publishes company holidays, work anniversaries, first days, birthdays, and who is out each weekday.
☆10Mar 4, 2023Updated 2 years ago

LostOxygen / llm-confidentialityView external linksLinks

Alternatives and similar repositories for llm-confidentiality

LostOxygen / llm-confidentiality
View external linksLinks