rucnyz/LeakAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rucnyz/LeakAgent)

rucnyz / LeakAgent

☆29

Alternatives and similar repositories for LeakAgent

Users that are interested in LeakAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pasquini-dario / LLM_NeuralExec
View on GitHub
Code to generate NeuralExecs (prompt injection for LLMs)
☆27Oct 5, 2025Updated 9 months ago
sherdencooper / PromptFuzz
View on GitHub
☆30Oct 23, 2024Updated last year
compsec-snu / pfi
View on GitHub
PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
☆31Mar 26, 2025Updated last year
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
sunblaze-ucb / progent
View on GitHub
Progent: Securing AI Agents with Privilege Control
☆42May 14, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fzwark / Secure_LLM_System
View on GitHub
☆16Mar 9, 2025Updated last year
LetterLiGo / Agent-WebCloak
View on GitHub
[IEEE S&P'26] WebCloak: Characterizing and Mitigating the Threats of LLM-Driven Web Agents as Intelligent Scrapers
☆28Jan 31, 2026Updated 5 months ago
AI-secure / AdvAgent
View on GitHub
☆25May 28, 2025Updated last year
XingTuLab / Cache_Me_Catch_You
View on GitHub
Cache Me, Catch You: Cache Related Security Threats in LLM Serving Frameworks (NDSS 2026)
☆18Dec 18, 2025Updated 7 months ago
RPC2 / AutoInject
View on GitHub
☆20Jun 12, 2026Updated last month
swj0419 / muse_bench
View on GitHub
☆34Mar 13, 2025Updated last year
Confirm-Solutions / flrt
View on GitHub
Fluent student-teacher redteaming
☆23Jul 25, 2024Updated 2 years ago
amazon-science / controlling-llm-memorization
View on GitHub
☆38May 19, 2023Updated 3 years ago
Greysahy / ipiguard
View on GitHub
[EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
☆22Sep 16, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AI4Good24 / PsySafe
View on GitHub
☆53Feb 8, 2025Updated last year
alkaet / LobotoMl
View on GitHub
LobotoMl is a set of scripts and tools to assess production deployments of ML services
☆10May 16, 2022Updated 4 years ago
ZJU-SEC / TensorAbuse
View on GitHub
TensorFlow API analysis tool and malicious model detection tool
☆41May 27, 2025Updated last year
invariantlabs-ai / invariant-gateway
View on GitHub
LLM proxy to observe and debug what your AI agents are doing.
☆78Nov 6, 2025Updated 8 months ago
locuslab / acr-memorization
View on GitHub
☆41Dec 19, 2024Updated last year
T1aNS1R / Evil-Geniuses
View on GitHub
☆71Feb 4, 2024Updated 2 years ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
locuslab / breaking-poisoned-classifier
View on GitHub
Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"
☆26Jan 7, 2022Updated 4 years ago
Gwinhen / DRUPE
View on GitHub
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆20Jan 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUST-KnowComp / LLM-Multistep-Jailbreak
View on GitHub
Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT
☆37Oct 15, 2023Updated 2 years ago
ZZZhr-1 / Robust_GUI_Grounding
View on GitHub
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Apr 8, 2025Updated last year
thu-coai / Targeted-Data-Extraction
View on GitHub
Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…
☆24May 8, 2023Updated 3 years ago
SolidShen / RIPPLE_official
View on GitHub
☆20Feb 11, 2024Updated 2 years ago
ml-postech / selective-generation
View on GitHub
☆11Dec 8, 2024Updated last year
microsoft / TaskTracker
View on GitHub
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…
☆92Sep 1, 2025Updated 10 months ago
eth-sri / llmprivacy
View on GitHub
☆75Feb 16, 2025Updated last year
Yuuoniy / SpecAuditor
View on GitHub
SpecAuditor: Generating Audit Specifications for LLM-Driven Bug Detection (S&P 2026)
☆23Jun 8, 2026Updated last month
google-research / lm-extraction-benchmark
View on GitHub
☆307Jun 10, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆140Feb 19, 2025Updated last year
PKU-ASAL / NoDrop
View on GitHub
☆19Jul 9, 2026Updated 2 weeks ago
yupeijei1997 / WildToolBench
View on GitHub
(ICLR 2026)Benchmarking LLM Tool-Use in the Wild
☆37Apr 5, 2026Updated 3 months ago
ethz-spylab / misleading-privacy-evals
View on GitHub
Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)
☆13Apr 29, 2024Updated 2 years ago
WUSTL-CSPL / LLMJailbreak
View on GitHub
☆36Sep 30, 2024Updated last year
zrporz / THU-CST-Undergraduate
View on GitHub
本人在贵系所学课程的作业和复习资料参考
☆22Jun 22, 2025Updated last year
David-Li0406 / AI-Supervision-Risk
View on GitHub
☆21Mar 17, 2025Updated last year