ellydee / acceptance-benchLinks

A robust LLM evaluation framework measuring acceptance vs refusal across difficulty levels. Features multi-prompt variation testing, temperature sweeping, and LLM-as-judge evaluation. Current focus: creative writing benchmarks including erotica generation tasks.
57Updated this week

Alternatives and similar repositories for acceptance-bench

Users that are interested in acceptance-bench are comparing it to the libraries listed below

Sorting: