ianarawjo / evalstatsView on GitHub
Statistical analysis methods for comparing prompt and model performance in LLM evaluations.
99Apr 17, 2026Updated last week

Alternatives and similar repositories for evalstats

Users that are interested in evalstats are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?