jayminban / 41-llms-evaluated-on-19-benchmarksLinks

This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.
72Updated 3 weeks ago

Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks

Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below

Sorting: