WUSTL-CSPL/LLMJailbreak

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WUSTL-CSPL/LLMJailbreak)

WUSTL-CSPL / LLMJailbreak

☆36

Alternatives and similar repositories for LLMJailbreak

Users that are interested in LLMJailbreak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SaFo-Lab / JailBreakV_28K
View on GitHub
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆96May 9, 2025Updated last year
CryptoAILab / misalignment
View on GitHub
[NDSS'25] The official implementation of safety misalignment.
☆19Jan 8, 2025Updated last year
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated 2 years ago
zh1yu4nyu / CodeIPPrompt
View on GitHub
https://icml.cc/virtual/2023/poster/24354
☆10Aug 15, 2023Updated 2 years ago
IBM / URET
View on GitHub
Universal Robustness Evaluation Toolkit (for Evasion)
☆32Sep 17, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EvanXiaa / Awesome-LLM_For-SE-Sec-Papers
View on GitHub
☆32Sep 22, 2024Updated last year
ZhangZhuoSJTU / LINT
View on GitHub
☆17Sep 4, 2024Updated last year
verazuo / prompt-stealing-attack
View on GitHub
[USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models
☆53Jan 11, 2025Updated last year
zhipeng-wei / EmojiAttack
View on GitHub
Emoji Attack [ICML 2025]
☆46Jul 15, 2025Updated last year
fzwark / Secure_LLM_System
View on GitHub
☆17Mar 9, 2025Updated last year
kriti-hippo / red_queen
View on GitHub
Red Queen Dataset and data generation template
☆27Dec 26, 2025Updated 7 months ago
whdii / TMM
View on GitHub
☆21Jan 15, 2024Updated 2 years ago
Testing4AI / RobOT
View on GitHub
Code release for RobOT (ICSE'21)
☆15Dec 5, 2022Updated 3 years ago
rarefin / TTS_VAE
View on GitHub
Text to Speech Synthesis based on controllable latent representation
☆14Aug 30, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zichuan-liu / IB4LLMs
View on GitHub
[NeurIPS'24] Protecting Your LLMs with Information Bottleneck
☆25Nov 7, 2024Updated last year
PurCL / ProSec
View on GitHub
Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"
☆18Feb 26, 2026Updated 5 months ago
SheltonLiu-N / AutoDAN
View on GitHub
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…
☆453Jan 22, 2025Updated last year
SaFo-Lab / AutoDAN-Turbo
View on GitHub
[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to…
☆383Oct 8, 2025Updated 9 months ago
NY1024 / RACE
View on GitHub
☆27Mar 17, 2025Updated last year
Aatrox103 / SAP
View on GitHub
☆49May 9, 2024Updated 2 years ago
AI45Lab / ActorAttack
View on GitHub
☆134Jun 29, 2026Updated last month
pasquini-dario / LLM_NeuralExec
View on GitHub
Code to generate NeuralExecs (prompt injection for LLMs)
☆27Oct 5, 2025Updated 9 months ago
serendipity1122 / Pre-trained-Model-Guided-Fine-Tuning-for-Zero-Shot-Adversarial-Robustness
View on GitHub
Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》
☆25May 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sherdencooper / GPTFuzz
View on GitHub
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
☆601Feb 27, 2026Updated 5 months ago
rucnyz / LeakAgent
View on GitHub
☆29Aug 31, 2025Updated 10 months ago
EasyJailbreak / EasyJailbreak
View on GitHub
An easy-to-use Python framework to generate adversarial jailbreak prompts.
☆876Mar 30, 2026Updated 3 months ago
XingTuLab / Cache_Me_Catch_You
View on GitHub
Cache Me, Catch You: Cache Related Security Threats in LLM Serving Frameworks (NDSS 2026)
☆18Dec 18, 2025Updated 7 months ago
OSU-NLP-Group / AmpleGCG
View on GitHub
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆87Nov 3, 2024Updated last year
chujiezheng / LLM-Safeguard
View on GitHub
Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
☆108May 20, 2025Updated last year
AI-secure / AdvAgent
View on GitHub
☆25May 28, 2025Updated last year
pasks87 / cybersecurity_automotive_can
View on GitHub
A python plugin integrated into Carla Simulator for emulate cyber-attack over the CAN-Bus (https://carla.org/)
☆15Apr 26, 2021Updated 5 years ago
xitongpu / PSPNet
View on GitHub
A implementation of PSPNet with MindSpore
☆11Dec 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chawins / llm-sp
View on GitHub
Papers and resources related to the security and privacy of LLMs 🤖
☆579Jun 8, 2025Updated last year
zhangrui4041 / Instruction_Backdoor_Attack
View on GitHub
☆25Aug 21, 2024Updated last year
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆140Feb 19, 2025Updated last year
ShiJiawenwen / JudgeDeceiver
View on GitHub
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆41Sep 17, 2025Updated 10 months ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
Tele-EVOL / TeleAI-Safety
View on GitHub
☆27Jan 5, 2026Updated 6 months ago
CryptoAPI-Bench / CryptoAPI-Bench
View on GitHub
☆14Jan 10, 2024Updated 2 years ago