aaFrostnova / PapillonLinks
[Usenix Security 2025] Official repo of paper PAPILLON: Efficient and Stealthy Fuzz Testing-Powered Jailbreaks for LLMs
☆15Updated 3 months ago
Alternatives and similar repositories for Papillon
Users that are interested in Papillon are comparing it to the libraries listed below
Sorting:
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆21Updated 5 months ago
- Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechan…☆14Updated last year
- Working Memory Attack on LLMs☆16Updated 3 months ago
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"☆13Updated 5 months ago
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆58Updated last year
- 1.0☆10Updated 2 months ago
- ☆61Updated 8 months ago
- [ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks☆21Updated 4 months ago
- [ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`☆80Updated 2 weeks ago
- ☆23Updated last year
- This repository provides a benchmark for prompt Injection attacks and defenses☆267Updated last month
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Updated last year