controllability / jailbreak-evaluation

The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.
22Updated 4 months ago

Alternatives and similar repositories for jailbreak-evaluation:

Users that are interested in jailbreak-evaluation are comparing it to the libraries listed below