guardagent/code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guardagent/code)

guardagent / code

☆47

Alternatives and similar repositories for code

Users that are interested in code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SaFo-Lab / AGrail4Agent
View on GitHub
[ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
☆42Aug 4, 2025Updated 11 months ago
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆40Oct 2, 2024Updated last year
SaFo-Lab / DRIFT
View on GitHub
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…
☆58Updated this week
AI-secure / AdvAgent
View on GitHub
☆25May 28, 2025Updated last year
m4p1e / agent-sentinel
View on GitHub
AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents
☆35Aug 31, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
iliaishacked / sponge_examples
View on GitHub
☆34Oct 14, 2021Updated 4 years ago
agiresearch / ASB
View on GitHub
Agent Security Bench (ASB)
☆271Apr 16, 2026Updated 3 months ago
AI-secure / UDora
View on GitHub
[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents
☆37Jun 24, 2025Updated last year
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆139Feb 19, 2025Updated last year
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆670Jun 2, 2026Updated last month
uiuc-kang-lab / InjecAgent
View on GitHub
☆152Jul 2, 2024Updated 2 years ago
TanqiuJiang / AgentLAB
View on GitHub
The official implementation of the paper "AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks"
☆26Jun 1, 2026Updated last month
luo-ziyuan / NeRF_Signature
View on GitHub
Source code of the paper "The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields".
☆18Mar 3, 2025Updated last year
SheltonLiu-N / Universal-Prompt-Injection
View on GitHub
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
☆73Oct 23, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CHATS-lab / ToolShield
View on GitHub
[ICML 2026] Official implementation for paper "Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Ag…
☆28Jul 6, 2026Updated 2 weeks ago
thu-coai / Agent-SafetyBench
View on GitHub
☆149Aug 11, 2025Updated 11 months ago
Astarojth / AgentAuditor-ASSEBench
View on GitHub
☆39May 29, 2026Updated last month
zhenxianglance / RE-paper
View on GitHub
Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing
☆15Feb 18, 2021Updated 5 years ago
Privatris / AgentLeak
View on GitHub
AgentLeak: Open benchmark for privacy leakage in LLM agents — 7 channels, multi-agent, multi-framework.
☆25Jul 1, 2026Updated 2 weeks ago
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆212Jun 26, 2025Updated last year
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆33Jan 10, 2025Updated last year
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆115Sep 27, 2024Updated last year
AI-secure / AgentPoison
View on GitHub
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆230Jun 17, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HyeonjeongHa / MM-PoisonRAG
View on GitHub
Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"
☆16Dec 4, 2025Updated 7 months ago
haoyuwang99 / AgentSpec
View on GitHub
☆45Jan 15, 2026Updated 6 months ago
KuofengGao / Verbose_Images
View on GitHub
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆44Jan 25, 2024Updated 2 years ago
alphadl / SafeLLM_with_IntentionAnalysis
View on GitHub
Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting
☆21Mar 25, 2024Updated 2 years ago
SaFo-Lab / MetaAgent
View on GitHub
Offical Repository of MetaAgent Program
☆53Dec 2, 2025Updated 7 months ago
zhenxianglance / PCBA
View on GitHub
A Backdoor Attack against 3D Point Cloud Classifiers (ICCV2021)
☆18Oct 20, 2021Updated 4 years ago
AI-secure / RedCode
View on GitHub
[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents
☆85Apr 24, 2026Updated 2 months ago
tml-epfl / os-harm
View on GitHub
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]
☆69Sep 18, 2025Updated 10 months ago
Junfei-Z / PRISM
View on GitHub
[AAAI2026] Official repo for paper "PRISM: Privacy-Aware Routing for Adaptive Cloud–Edge LLM Inference via Semantic Sketch Collaboration"
☆23Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
leolee99 / PIGuard
View on GitHub
[ACL 2025] The official implementation of the paper "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free".
☆79Dec 4, 2025Updated 7 months ago
OSU-NLP-Group / AgentSafety
View on GitHub
☆191Oct 31, 2025Updated 8 months ago
sleeepeer / PIArena
View on GitHub
[ACL 2026] PIArena: A Platform for Prompt Injection Evaluation
☆39Apr 28, 2026Updated 2 months ago
ucsb-mlsec / Awesome-Agent-Security
View on GitHub
☆59Jun 24, 2026Updated 3 weeks ago
dsh3n77 / MINJA
View on GitHub
Memory Injection Attacks on LLM Agents via Query-Only Interaction
☆29Feb 10, 2026Updated 5 months ago
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
wshi83 / EhrAgent
View on GitHub
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
☆137Dec 26, 2024Updated last year