NISPLab / JBShield

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
38Updated last week

Alternatives and similar repositories for JBShield:

Users that are interested in JBShield are comparing it to the libraries listed below