kuk / simple-evals-ru

Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
23Updated last week

Alternatives and similar repositories for simple-evals-ru:

Users that are interested in simple-evals-ru are comparing it to the libraries listed below