usail-hkust / JailjudgeView on GitHub
JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios, etc.) along with high-quality human- annotated test datasets.
59Dec 13, 2024Updated last year

Alternatives and similar repositories for Jailjudge

Users that are interested in Jailjudge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?