controllability / jailbreak-evaluation

The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.
19Updated 2 months ago

Alternatives and similar repositories for jailbreak-evaluation:

Users that are interested in jailbreak-evaluation are comparing it to the libraries listed below