usail-hkust / JailjudgeLinks

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios, etc.) along with high-quality human- annotated test datasets.
47Updated 6 months ago

Alternatives and similar repositories for Jailjudge

Users that are interested in Jailjudge are comparing it to the libraries listed below

Sorting: