theshi-1128 / ReDPJLinks
☆21Updated last month
Alternatives and similar repositories for ReDPJ
Users that are interested in ReDPJ are comparing it to the libraries listed below
Sorting:
- ☆89Updated last year
- [NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.☆172Updated 7 months ago
- The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…☆141Updated 2 months ago
- ☆67Updated 10 months ago
- Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)