aisa-group/skill-inject

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aisa-group/skill-inject)

aisa-group / skill-inject

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

☆88

Alternatives and similar repositories for skill-inject

Users that are interested in skill-inject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiaxiaojunQAQ / SkillJect
View on GitHub
SkillJect: Automating Stealthy Skill-Based Prompt Injection for Coding Agents with Trace-Driven Closed-Loop Refinement
☆73Jun 11, 2026Updated last month
protectskills / MaliciousAgentSkillsBench
View on GitHub
A Security Benchmark for Claude Code Agent Skills
☆69Jul 8, 2026Updated last week
Zhow01 / SkillAttack
View on GitHub
☆52May 19, 2026Updated 2 months ago
aisa-group / promptinject-agent-skills
View on GitHub
Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections
☆21Jul 2, 2026Updated 2 weeks ago
TrustAIRLab / HarmfulSkillBench
View on GitHub
The Official Repository for Paper "HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?"
☆15May 2, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TanqiuJiang / AgentLAB
View on GitHub
The official implementation of the paper "AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks"
☆26Jun 1, 2026Updated last month
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated last year
RPC2 / AutoInject
View on GitHub
☆20Jun 12, 2026Updated last month
lxyeternal / MalSkillBench
View on GitHub
A benchmark and generation framework for malicious agent skills.
☆39Jun 10, 2026Updated last month
AI45Lab / skill-safety-bench
View on GitHub
☆28May 14, 2026Updated 2 months ago
facebookresearch / prompt-siren
View on GitHub
A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities a…
☆54Updated this week
MurrayTom / ToolSafe
View on GitHub
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…
☆70Mar 25, 2026Updated 3 months ago
qualixar / skillfortify
View on GitHub
First formal security scanner for AI agent skills & plugins. Static analysis, supply chain verification, SBOM generation. 22 frameworks s…
☆26May 25, 2026Updated last month
Open-Agent-Safety / OpenAgentSafety
View on GitHub
Evaluating Agent Safety in Realistic, High-Risk Simulations
☆31Jul 6, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AI-secure / UDora
View on GitHub
[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents
☆37Jun 24, 2025Updated last year
eth-sri / privacy-inference-multimodal
View on GitHub
☆21Feb 3, 2025Updated last year
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆668Jun 2, 2026Updated last month
ShiJiawenwen / JudgeDeceiver
View on GitHub
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆41Sep 17, 2025Updated 10 months ago
agiresearch / ASB
View on GitHub
Agent Security Bench (ASB)
☆270Apr 16, 2026Updated 3 months ago
CHATS-lab / ToolShield
View on GitHub
[ICML 2026] Official implementation for paper "Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Ag…
☆28Jul 6, 2026Updated 2 weeks ago
m4p1e / agent-sentinel
View on GitHub
AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents
☆35Aug 31, 2025Updated 10 months ago
AstorYH / PASB
View on GitHub
An end-to-end security evaluation framework tailored for real-world personalized agent.
☆15Feb 28, 2026Updated 4 months ago
DPamK / BadAgent
View on GitHub
☆33Feb 27, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
liuchen11 / AdversaryLossLandscape
View on GitHub
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Jul 3, 2021Updated 5 years ago
OSU-NLP-Group / RedTeamCUA
View on GitHub
[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
☆56Feb 9, 2026Updated 5 months ago
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 3 months ago
S2yyyy / OpenClaw-Analysis
View on GitHub
☆31Mar 11, 2026Updated 4 months ago
GraySwanAI / ipi_arena_os
View on GitHub
☆42Mar 18, 2026Updated 4 months ago
LLM-QC / judgezoo
View on GitHub
A collection of judges for evaluating LLM model output for safety & toxicity with a standardized API.
☆15Jan 7, 2026Updated 6 months ago
albert-y1n / PISmith
View on GitHub
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
☆21Updated this week
SaFo-Lab / DRIFT
View on GitHub
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…
☆58Updated this week
xjzzzzzzzz / MCPSafety
View on GitHub
☆22Dec 18, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
qizhangli / MoreBayesian-attack
View on GitHub
Code for our ICLR 2023 paper Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples.
☆18May 31, 2023Updated 3 years ago
skilltester-ai / skilltester
View on GitHub
☆32Jul 1, 2026Updated 2 weeks ago
Astarojth / AgentAuditor-ASSEBench
View on GitHub
☆39May 29, 2026Updated last month
facebookresearch / jailbreak-objectives
View on GitHub
Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"
☆37Jul 2, 2026Updated 2 weeks ago
dongsenzhang / MSB
View on GitHub
☆38Mar 24, 2026Updated 3 months ago
uiuc-kang-lab / AdaptiveAttackAgent
View on GitHub
☆38Mar 12, 2025Updated last year