☆29Aug 31, 2025Updated 9 months ago
Alternatives and similar repositories for LeakAgent
Users that are interested in LeakAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 8 months ago
- ☆31Oct 23, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- ☆15Mar 9, 2025Updated last year
- [ACL 2024] Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications☆18Apr 9, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25May 28, 2025Updated last year
- Progent: Securing AI Agents with Privilege Control☆38May 14, 2026Updated last month
- MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP cl…☆34Sep 1, 2025Updated 9 months ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆89Sep 1, 2025Updated 9 months ago
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆20Jan 27, 2024Updated 2 years ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆137Feb 19, 2025Updated last year
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ☆34Mar 13, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆40May 19, 2023Updated 3 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆41Aug 4, 2025Updated 10 months ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 4 years ago
- ☆53Feb 8, 2025Updated last year
- ☆71Feb 4, 2024Updated 2 years ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated last year
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- ☆40Dec 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆37Oct 15, 2023Updated 2 years ago
- Agent Security Bench (ASB)☆260Apr 16, 2026Updated last month
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models☆26Sep 27, 2025Updated 8 months ago
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆29Mar 15, 2025Updated last year
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆105Jan 20, 2025Updated last year
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- use angr to deobfuscation☆10Oct 8, 2019Updated 6 years ago
- ☆20Feb 11, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆24May 8, 2023Updated 3 years ago
- ☆73Feb 16, 2025Updated last year
- Code for the paper "Defeating Prompt Injections by Design"☆341Jun 20, 2025Updated 11 months ago
- ☆306Updated this week
- VulnGym: A Real-World, Project-Level Vulnerability Benchmark for White-Box Vulnerability-Hunting Agents☆159Jun 2, 2026Updated last week
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆613Jun 2, 2026Updated last week
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆19Jun 7, 2023Updated 3 years ago