Aleph-Alpha-Research / eval-frameworkView on GitHub
Comprehensive LLM evaluation at scale: A production-ready framework for evaluating large language models across multiple benchmarks.
36Updated this week

Alternatives and similar repositories for eval-framework

Users that are interested in eval-framework are comparing it to the libraries listed below

Sorting:

Are these results useful?