SORRY-Bench / sorry-bench

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
33Updated 4 months ago

Related projects

Alternatives and complementary repositories for sorry-bench