kuk / simple-evals-ruLinks

Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
23Updated last month

Alternatives and similar repositories for simple-evals-ru

Users that are interested in simple-evals-ru are comparing it to the libraries listed below

Sorting: