centerforaisafety / HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
569Updated 6 months ago

Alternatives and similar repositories for HarmBench:

Users that are interested in HarmBench are comparing it to the libraries listed below