user1342 / Awesome-LLM-Red-Teaming
A curated list of awesome LLM Red Teaming training, resources, and tools.
β16Updated last month
Alternatives and similar repositories for Awesome-LLM-Red-Teaming:
Users that are interested in Awesome-LLM-Red-Teaming are comparing it to the libraries listed below
- Professional Wargaming LLM Toolboxβ11Updated 6 months ago
- Prompt Injection Attacks against GPT-4, Gemini, Azure, Azure with Jailbreakβ21Updated 6 months ago
- π€π‘οΈπππ Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.β23Updated 11 months ago
- OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both β¦β17Updated 7 months ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.β46Updated 5 months ago
- [SPOILER ALERT] Solutions to Gandalf, the prompt hacking/red teaming game from Lakera AIβ18Updated last year
- Zero-trust AI APIs for easy and private consumption of open-source LLMsβ40Updated 9 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to poteβ¦β158Updated 3 weeks ago
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. β¦β47Updated last year
- Whispers in the Machine: Confidentiality in LLM-integrated Systemsβ36Updated last month
- Manual Prompt Injection / Red Teaming Toolβ27Updated 6 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).β109Updated last year
- Portal: GUI Tools for Agentsβ21Updated 3 weeks ago
- A reverse search tool for OSINT (Open Source Intelligence) gathering & facial recognition via Google Custom Search & Google Vision API's.β39Updated last year
- TLS & API keys for your LLM APIsβ16Updated 4 months ago
- A trial-and-error approach to temperature opimization for LLMs. Runs the same prompt at many temperatures and selects the best output autβ¦β56Updated last year
- A guide to LLM hacking: fundamentals, prompt injection, offense, and defenseβ148Updated 2 years ago
- Red-Teaming Language Models with DSPyβ183Updated 2 months ago
- This repository contains various attack against Large Language Models.β104Updated 11 months ago
- Penetration Testing AI Assistant based on open source LLMs.β70Updated 2 weeks ago
- A list of curated resources for people interested in AI Red Teaming, Jailbreaking, and Prompt Injectionβ101Updated last week
- A curated list of my GitHub stars!β19Updated this week
- β21Updated 11 months ago
- Risks and targets for assessing LLMs & LLM vulnerabilitiesβ30Updated 11 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the projectβ40Updated 4 months ago
- YAWNING TITAN is an abstract, graph based cyber-security simulation environment that supports the training of intelligent agents for autoβ¦β63Updated 11 months ago
- β20Updated last year
- LLM Optimize is a proof-of-concept library for doing LLM (large language model) guided blackbox optimization.β56Updated last year
- SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracleβ110Updated 2 years ago
- Learn about a type of vulnerability that specifically targets machine learning modelsβ260Updated 10 months ago