kuk / simple-evals-ru
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
☆23Updated 2 weeks ago
Alternatives and similar repositories for simple-evals-ru:
Users that are interested in simple-evals-ru are comparing it to the libraries listed below
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 6 months ago
- Augmentex — a library for augmenting texts with errors☆63Updated 9 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 2 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 3 months ago
- Russian Corpus of Linguistic Acceptability☆43Updated 6 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆25Updated 2 weeks ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆40Updated 3 weeks ago
- ☆18Updated 3 years ago
- Efficient DL/ML Models Seminars☆29Updated 3 months ago
- Effective LLM Alignment Toolkit☆125Updated this week
- ☆27Updated this week
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- ☆12Updated last year
- Автоматическая обработка естественного языка для студентов 3-4 курсов Школы лингвистики НИУ ВШЭ.☆14Updated last year
- "Руформеры" - список популярных базовых моделей на основе трансформеров для решения задач по автоматической обработке русского языка☆36Updated last year
- ☆57Updated last year
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 weeks ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆12Updated last year
- BSNLP 2021☆33Updated 5 months ago
- Простой нормализатор текстов перед синтезом речи☆31Updated 11 months ago
- Deep Learning for Speech☆90Updated 3 months ago
- A Python wrapper for the RuWordNet thesaurus☆62Updated 4 months ago
- MMLU eval for RU/EN☆15Updated last year
- Репозиторий курса "Практические аспекты обучения больших языковых моделей", ВМК МГУ, осень, 2024☆14Updated 3 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated 2 weeks ago
- ☆10Updated last year
- ☆26Updated last week
- Библиотека для извлечения статистик из текстов на русском языке.☆120Updated 2 years ago