rucnyz / LeakAgentView external linksLinks
☆28Aug 31, 2025Updated 5 months ago
Alternatives and similar repositories for LeakAgent
Users that are interested in LeakAgent are comparing it to the libraries listed below
Sorting:
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 4 months ago
- ☆13Mar 9, 2025Updated 11 months ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- ☆27Sep 11, 2025Updated 5 months ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆79Sep 1, 2025Updated 5 months ago
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆20Jan 27, 2024Updated 2 years ago
- ☆33Mar 13, 2025Updated 11 months ago
- ☆20Feb 11, 2024Updated 2 years ago
- ☆52Feb 8, 2025Updated last year
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 2 years ago
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆25Mar 15, 2025Updated 10 months ago
- ☆21Mar 17, 2025Updated 10 months ago
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ☆70Feb 4, 2024Updated 2 years ago
- ☆70Feb 16, 2025Updated 11 months ago
- Code for "Adversarial Illusions in Multi-Modal Embeddings"☆31Aug 4, 2024Updated last year
- Auditing agents for fine-tuning safety☆18Oct 21, 2025Updated 3 months ago
- Agent Security Bench (ASB)☆182Oct 27, 2025Updated 3 months ago
- ☆31Jul 14, 2023Updated 2 years ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆431Feb 3, 2026Updated last week
- Code for the paper "Defeating Prompt Injections by Design"☆246Jun 20, 2025Updated 7 months ago
- ☆37Sep 30, 2024Updated last year
- ☆37Dec 19, 2024Updated last year
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆95Jan 20, 2025Updated last year
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆35Oct 15, 2023Updated 2 years ago
- Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks (IEEE S&P 2024)☆34Jun 29, 2025Updated 7 months ago
- [CCS 2021] "DataLens: Scalable Privacy Preserving Training via Gradient Compression and Aggregation" by Boxin Wang*, Fan Wu*, Yunhui Long…☆36Dec 28, 2021Updated 4 years ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆50Dec 23, 2024Updated last year
- [NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…☆35Feb 4, 2026Updated last week
- [ICCV 2023] "TRM-UAP: Enhancing the Transferability of Data-Free Universal Adversarial Perturbation via Truncated Ratio Maximization", Yi…☆12Jul 17, 2024Updated last year
- Official frontend web application for Moltbook - The Social Network for AI Agents. Built with Next.js 14, TypeScript, Tailwind CSS featur…☆25Feb 1, 2026Updated last week
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆109Sep 27, 2024Updated last year
- Whispers in the Machine: Confidentiality in Agentic Systems☆41Dec 11, 2025Updated 2 months ago
- A curated list of academic events on AI Security & Privacy☆167Aug 22, 2024Updated last year
- TYPO3 Extension ⇢ Integration of sendinblue as finisher of the form extension☆11Jan 23, 2025Updated last year
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- Pre-trained Online Contrastive Learning for Insurance Fraud Detection☆12Jul 12, 2024Updated last year
- ☆12Apr 3, 2020Updated 5 years ago