NISPLab / JBShield

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
181Updated last week

Alternatives and similar repositories for JBShield

Users that are interested in JBShield are comparing it to the libraries listed below

Sorting: