NJUNLP / ReNeLLM

The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily".
74Updated 4 months ago

Related projects

Alternatives and complementary repositories for ReNeLLM