kuk / simple-evals-ruView on GitHub
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
23Apr 16, 2025Updated 10 months ago

Alternatives and similar repositories for simple-evals-ru

Users that are interested in simple-evals-ru are comparing it to the libraries listed below

Sorting:

Are these results useful?