agiresearch/ASB

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/agiresearch/ASB)

agiresearch / ASB

Agent Security Bench (ASB)

☆273

Alternatives and similar repositories for ASB

Users that are interested in ASB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆678Jun 2, 2026Updated last month
uiuc-kang-lab / InjecAgent
View on GitHub
☆153Jul 2, 2024Updated 2 years ago
thu-coai / Agent-SafetyBench
View on GitHub
☆149Aug 11, 2025Updated 11 months ago
kaijiezhu11 / MELON
View on GitHub
[ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
☆36Jul 31, 2025Updated 11 months ago
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆116Sep 27, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AI-secure / AgentPoison
View on GitHub
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆230Jun 17, 2026Updated last month
xjzzzzzzzz / MCPSafety
View on GitHub
☆22Dec 18, 2025Updated 7 months ago
OSU-NLP-Group / AgentSafety
View on GitHub
☆192Oct 31, 2025Updated 8 months ago
whfeLingYu / DemonAgent
View on GitHub
☆18Apr 1, 2025Updated last year
Astarojth / AgentAuditor-ASSEBench
View on GitHub
☆39May 29, 2026Updated last month
SaFo-Lab / AgentDyn
View on GitHub
The official implementation of the paper "AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?"
☆68May 19, 2026Updated 2 months ago
TanqiuJiang / AgentLAB
View on GitHub
The official implementation of the paper "AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks"
☆26Jun 1, 2026Updated last month
albert-y1n / PISmith
View on GitHub
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
☆22Jul 17, 2026Updated last week
MurrayTom / ToolSafe
View on GitHub
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…
☆73Mar 25, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
guardagent / code
View on GitHub
☆47Dec 9, 2025Updated 7 months ago
sunblaze-ucb / progent
View on GitHub
Progent: Securing AI Agents with Privilege Control
☆41May 14, 2026Updated 2 months ago
compsec-snu / pfi
View on GitHub
PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
☆31Mar 26, 2025Updated last year
sleeepeer / PoisonedRAG
View on GitHub
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆286Jan 27, 2026Updated 5 months ago
liu00222 / Open-Prompt-Injection
View on GitHub
This repository provides a benchmark for prompt injection attacks and defenses in LLMs
☆466Oct 29, 2025Updated 8 months ago
m4p1e / agent-sentinel
View on GitHub
AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents
☆35Aug 31, 2025Updated 10 months ago
facebookresearch / wasp
View on GitHub
Official implementation of the WASP web agent security benchmark
☆98Apr 13, 2026Updated 3 months ago
AI-secure / RedCode
View on GitHub
[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents
☆86Apr 24, 2026Updated 3 months ago
Jacobhhy / Agent-Memory-Poisoning
View on GitHub
☆21Dec 11, 2025Updated 7 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
AI-secure / PolyGuard
View on GitHub
☆23Jun 18, 2025Updated last year
SaFo-Lab / DRIFT
View on GitHub
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…
☆58Jul 16, 2026Updated last week
AI45Lab / skill-safety-bench
View on GitHub
☆29May 14, 2026Updated 2 months ago
aisa-group / skill-inject
View on GitHub
Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks
☆88Jul 1, 2026Updated 3 weeks ago
dongsenzhang / MSB
View on GitHub
☆38Mar 24, 2026Updated 4 months ago
AI-secure / UDora
View on GitHub
[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents
☆37Jun 24, 2025Updated last year
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
fzwark / Secure_LLM_System
View on GitHub
☆16Mar 9, 2025Updated last year
Zhang-Henry / INACTIVE
View on GitHub
The official implementation of CVPR 2025 paper "Invisible Backdoor Attack against Self-supervised Learning"
☆19Jul 5, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wangbo9719 / MEXTRA
View on GitHub
Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"
☆34Dec 2, 2025Updated 7 months ago
AI-secure / DecodingTrust-Agent
View on GitHub
☆69Jun 18, 2026Updated last month
AIS2Lab / MCPSecBench
View on GitHub
MCPSecBench: A Systematic Security Benchmark and Playground for Testing Model Context Protocols
☆35Mar 4, 2026Updated 4 months ago
pasquini-dario / LLM_NeuralExec
View on GitHub
Code to generate NeuralExecs (prompt injection for LLMs)
☆27Oct 5, 2025Updated 9 months ago
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 4 months ago
RPC2 / AutoInject
View on GitHub
☆20Jun 12, 2026Updated last month
Open-Agent-Safety / OpenAgentSafety
View on GitHub
Evaluating Agent Safety in Realistic, High-Risk Simulations
☆31Jul 6, 2026Updated 2 weeks ago