sshh12 / llm_backdoorLinks
Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running any actual code on the victim's machine or thwart LLM-based fraud/moderation systems.
☆200Updated 2 months ago
Alternatives and similar repositories for llm_backdoor
Users that are interested in llm_backdoor are comparing it to the libraries listed below
Sorting:
- Code snippets to reproduce MCP tool poisoning attacks.☆188Updated 8 months ago
- Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems☆222Updated 3 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆152Updated last year
- Lightweight LLM Interaction Framework☆400Updated last week
- AI agent for autonomous cyber operations☆451Updated last month
- Repo with random useful scripts, utilities, prompts and stuff☆193Updated this week
- This repository contains various attack against Large Language Models.☆122Updated last year
- Use LLMs for document ranking☆160Updated 8 months ago
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks☆92Updated 7 months ago
- ☆271Updated 2 weeks ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆52Updated last year
- ☆124Updated last week
- LLM | Security | Operations in one github repo with good links and pictures.☆81Updated last week
- Raptor turns Claude Code into a general-purpose AI offensive/defensive security agent. By using Claude.md and creating rules, sub-agents,…☆897Updated last week
- Using Agents To Automate Pentesting☆344Updated 11 months ago
- We present MAPTA, a multi-agent system for autonomous web application security assessment that combines large language model orchestratio…☆86Updated 4 months ago
- Codebase of https://arxiv.org/abs/2410.14923☆52Updated last year
- MCPSafetyScanner - Automated MCP safety auditing and remediation using Agents. More info: https://www.arxiv.org/abs/2504.03767☆159Updated 8 months ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆82Updated 7 months ago
- https://arxiv.org/abs/2412.02776☆67Updated last year
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆44Updated 3 weeks ago
- Code for the paper "Defeating Prompt Injections by Design"☆187Updated 6 months ago
- What does gpt-oss tell us about OpenAI's training data?☆33Updated 3 months ago
- A Model Context Protocol (MCP) server for querying the VirusTotal API.☆95Updated 9 months ago
- Red-Teaming Language Models with DSPy☆248Updated 10 months ago
- ☆51Updated last week
- A sandbox environment designed for loading, running and profiling a wide range of files, including machine learning models, ELFs, Pickle,…☆338Updated last week
- DeepTeam is a framework to red team LLMs and LLM systems.☆1,206Updated this week
- A knowledge source about TTPs used to target GenAI-based systems, copilots and agents☆131Updated last week
- NeuroSploitv2 is an advanced, AI-powered penetration testing framework designed to automate and augment various aspects of offensive secu…☆211Updated last week