kuk / simple-evals-ru
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
☆23Updated 3 weeks ago
Alternatives and similar repositories for simple-evals-ru:
Users that are interested in simple-evals-ru are comparing it to the libraries listed below
- Augmentex — a library for augmenting texts with errors☆63Updated 10 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 7 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆25Updated last month
- Russian Corpus of Linguistic Acceptability☆43Updated 7 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆42Updated last month
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 4 months ago
- Effective LLM Alignment Toolkit☆128Updated 3 weeks ago
- ☆27Updated 3 weeks ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 2 months ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- MMLU eval for RU/EN☆15Updated last year
- ☆57Updated last year
- ☆12Updated last year
- Репозиторий курса "Практические аспекты обучения больших языковых моделей", ВМК МГУ, осень, 2024☆15Updated 4 months ago
- Deep Learning for Speech☆92Updated 4 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- ☆10Updated last year
- Automatic hyperparameters tuning for topic models (ARTM approach) using evolutionary algorithms☆25Updated 3 months ago
- ☆18Updated 3 years ago
- Automatic Speech Recognition in Python using ONNX models☆15Updated last week
- Простой нормализатор текстов перед синтезом речи☆32Updated 11 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated last month
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- Efficient DL/ML Models Seminars☆30Updated 4 months ago
- Top ML papers of the week.☆31Updated this week
- https://arxiv.org/abs/2201.06499☆29Updated last year
- Probing suite for evaluation of Russian embedding and language models☆33Updated 7 months ago
- BSNLP 2021☆33Updated 6 months ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago