xsankar / AI-Red-TeamingLinks
All things specific to LLM Red Teaming Generative AI
☆29Updated last year
Alternatives and similar repositories for AI-Red-Teaming
Users that are interested in AI-Red-Teaming are comparing it to the libraries listed below
Sorting:
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆150Updated last year
- Payloads for Attacking Large Language Models☆114Updated 6 months ago
- source code for the offsecml framework☆45Updated last year
- using ML models for red teaming☆45Updated 2 years ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆116Updated last year
- ☆98Updated 4 months ago
- ☆68Updated last week
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆90Updated last week
- Data Scientists Go To Jupyter☆68Updated 9 months ago
- ☆44Updated last year
- LLM | Security | Operations in one github repo with good links and pictures.☆71Updated last week
- General research for Dreadnode☆27Updated last year
- Autonomous Assumed Breach Penetration-Testing Active Directory Networks☆31Updated last month
- Awesome products for securing AI systems includes open source and commercial options and an infographic licensed CC-BY-SA-4.0.☆79Updated last year
- https://arxiv.org/abs/2412.02776☆66Updated last year
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆166Updated 2 years ago
- ☆21Updated last year
- ☆50Updated last week
- ☆122Updated last week
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆42Updated last week
- A LLM explicitly designed for getting hacked☆165Updated 2 years ago
- [IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the vict…☆43Updated 10 months ago
- Reference notes for Attacking and Defending Generative AI presentation☆67Updated last year
- LLM Testing Findings Templates☆75Updated last year
- An experimental project exploring the use of Large Language Models (LLMs) to solve HackTheBox machines autonomously.☆185Updated 2 weeks ago
- A curated list of awesome AI Red Teaming resources and tools.☆28Updated 2 years ago
- CALDERA plugin for adversary emulation of AI-enabled systems☆105Updated 2 years ago
- Prototype of Full Agentic Application Security Testing, FAAST = SAST + DAST + LLM agents☆67Updated 7 months ago
- An interactive CLI application for interacting with authenticated Jupyter instances.☆54Updated 7 months ago
- Indirect Prompt Injection Methodology (IPIM) - A structured process which security professionals can use to find Indirect Prompt Injectio…☆14Updated 4 months ago