NISPLab / JBShieldLinks

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
188Updated last month

Alternatives and similar repositories for JBShield

Users that are interested in JBShield are comparing it to the libraries listed below

Sorting: