kuk / simple-evals-ruView on GitHub
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
23Apr 16, 2025Updated 11 months ago

Alternatives and similar repositories for simple-evals-ru

Users that are interested in simple-evals-ru are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?