☆29Aug 31, 2025Updated 6 months ago
Alternatives and similar repositories for LeakAgent
Users that are interested in LeakAgent are comparing it to the libraries listed below
Sorting:
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 5 months ago
- ☆14Mar 9, 2025Updated 11 months ago
- ☆27Oct 6, 2024Updated last year
- ☆29Oct 23, 2024Updated last year
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- ☆28Sep 11, 2025Updated 5 months ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆81Sep 1, 2025Updated 6 months ago
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆20Jan 27, 2024Updated 2 years ago
- ☆33Mar 13, 2025Updated 11 months ago
- ☆39May 19, 2023Updated 2 years ago
- ☆20Feb 11, 2024Updated 2 years ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆33Aug 4, 2025Updated 7 months ago
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆19Jun 7, 2023Updated 2 years ago
- ☆52Feb 8, 2025Updated last year
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 2 years ago
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆27Mar 15, 2025Updated 11 months ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆130Feb 19, 2025Updated last year
- ☆70Feb 4, 2024Updated 2 years ago
- ☆71Feb 16, 2025Updated last year
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.☆29Nov 19, 2023Updated 2 years ago
- Code for "Adversarial Illusions in Multi-Modal Embeddings"☆31Aug 4, 2024Updated last year
- Agent Security Bench (ASB)☆186Oct 27, 2025Updated 4 months ago
- ☆31Jul 14, 2023Updated 2 years ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆454Feb 3, 2026Updated last month
- ☆37Oct 17, 2024Updated last year
- ☆301Jan 13, 2026Updated last month
- ☆37Sep 30, 2024Updated last year
- ☆37Dec 19, 2024Updated last year
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆96Jan 20, 2025Updated last year
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago
- Code for the paper "Defeating Prompt Injections by Design"☆268Jun 20, 2025Updated 8 months ago
- Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks (IEEE S&P 2024)☆34Jun 29, 2025Updated 8 months ago
- [CCS 2021] "DataLens: Scalable Privacy Preserving Training via Gradient Compression and Aggregation" by Boxin Wang*, Fan Wu*, Yunhui Long…☆36Dec 28, 2021Updated 4 years ago
- Alcatel-Lucent SR/OS (TiMOS) Firmware Packer☆11May 31, 2017Updated 8 years ago
- Smart Sessions with EIP-7702 EOA☆16Nov 15, 2024Updated last year
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- [NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…☆39Feb 14, 2026Updated 2 weeks ago