thu-coai / JailbreakDefense_GoalPriority

[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
13Updated 4 months ago

Related projects

Alternatives and complementary repositories for JailbreakDefense_GoalPriority