ellydee / acceptance-benchView on GitHub
A robust LLM evaluation framework measuring acceptance vs refusal across difficulty levels. Features multi-prompt variation testing, temperature sweeping, and LLM-as-judge evaluation. Current focus: creative writing benchmarks including erotica generation tasks.
85Oct 16, 2025Updated 4 months ago

Alternatives and similar repositories for acceptance-bench

Users that are interested in acceptance-bench are comparing it to the libraries listed below

Sorting:

Are these results useful?