yjw1029 / Self-Reminder

Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.
43Updated last year

Related projects

Alternatives and complementary repositories for Self-Reminder