SORRY-Bench / sorry-benchLinks

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)
54Updated 3 months ago

Alternatives and similar repositories for sorry-bench

Users that are interested in sorry-bench are comparing it to the libraries listed below

Sorting: