idllresearch / malicious-gpt

[USENIX Security '24] Dataset associated with real-world malicious LLM applications, including 45 malicious prompts for generating malicious content, malicious responses from LLMs, 182 real-world jailbreak prompts, keywords related to LLMs, etc.
53Updated last month

Related projects

Alternatives and complementary repositories for malicious-gpt