Whispers in the Machine: Confidentiality in Agentic Systems
☆42Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for llm-confidentiality
Users that are interested in llm-confidentiality are comparing it to the libraries listed below
Sorting:
- Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the Over…☆13Aug 21, 2023Updated 2 years ago
- LLM security and privacy☆54Oct 15, 2024Updated last year
- Risks and targets for assessing LLMs & LLM vulnerabilities☆34May 27, 2024Updated last year
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆14Jan 16, 2026Updated last month
- ☆43May 23, 2023Updated 2 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆64Nov 10, 2025Updated 3 months ago
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆19Feb 18, 2025Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆57Aug 17, 2024Updated last year
- This repository provides a benchmark for prompt injection attacks and defenses in LLMs☆396Oct 29, 2025Updated 4 months ago
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆26May 16, 2024Updated last year
- LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins☆29Jul 29, 2024Updated last year
- Future version of the AnyBody Managed Model Repository with a full thoracic spine model.☆18Updated this week
- Papers and resources related to the security and privacy of LLMs 🤖☆570Jun 8, 2025Updated 9 months ago
- ☆30Oct 23, 2024Updated last year
- ☆11Feb 10, 2026Updated 3 weeks ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆26Jul 6, 2024Updated last year
- ☆29Aug 31, 2025Updated 6 months ago
- ☆25Feb 2, 2026Updated last month
- This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.☆29Nov 19, 2023Updated 2 years ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆379Jan 23, 2025Updated last year
- A collection of prompt injection mitigation techniques.☆27Aug 19, 2023Updated 2 years ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆116Jun 13, 2024Updated last year
- An improved version of Sublist3r, a python based Fast subdomains enumeration tool for penetration testers☆10Feb 10, 2024Updated 2 years ago
- A script that gives you the credentials of a Pterodactyl panel vulnerable to CVE-2025-49132☆17Jun 22, 2025Updated 8 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- [ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training☆11Sep 13, 2024Updated last year
- ☆10Dec 5, 2025Updated 3 months ago
- ☆86Sep 5, 2025Updated 6 months ago
- [CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge☆39Sep 17, 2025Updated 5 months ago
- ☆37Oct 17, 2024Updated last year
- This is the code for the paper "Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation".☆37Sep 1, 2025Updated 6 months ago
- This toolkit guides you on implementing secure and user-friendly digital and in-person interactions. Whether you are a service owner, dev…☆10Nov 6, 2025Updated 4 months ago
- ForgeRock Identity Cloud Debug Tools☆11Jan 27, 2023Updated 3 years ago
- Open library of musculoskeletal models and examples ready to be used with the AnyBody Modelling System.☆30Mar 2, 2026Updated last week
- You can use it to modify HTTP (S) response values, redirect static file requests to the local file directory, and support batch modificat…☆18Nov 30, 2022Updated 3 years ago
- Dropbox LLM Security research code and results☆255May 21, 2024Updated last year
- ☆701Jul 2, 2025Updated 8 months ago
- POC for CVE-2024-31982: XWiki Platform Remote Code Execution > 14.10.20☆10Jun 22, 2024Updated last year