SORRY-Bench / sorry-benchView on GitHub
Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)
76Mar 1, 2025Updated last year

Alternatives and similar repositories for sorry-bench

Users that are interested in sorry-bench are comparing it to the libraries listed below

Sorting:

Are these results useful?