kuk / simple-evals-ruLinks
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
☆23Updated 2 months ago
Alternatives and similar repositories for simple-evals-ru
Users that are interested in simple-evals-ru are comparing it to the libraries listed below
Sorting:
- Augmentex — a library for augmenting texts with errors☆65Updated 11 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 8 months ago
- ☆13Updated last year
- Russian Corpus of Linguistic Acceptability☆44Updated 8 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆26Updated 2 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆42Updated 3 months ago
- ☆32Updated 2 months ago
- ☆57Updated last year
- Автоматическая обработка естественного языка для студентов 3-4 курсов Школы лингвистики НИУ ВШЭ.☆14Updated last year
- Efficient DL/ML Models Seminars☆31Updated 5 months ago
- Effective LLM Alignment Toolkit☆132Updated last month
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆155Updated 6 months ago
- ☆18Updated 4 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 4 months ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆21Updated 2 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- Automatic hyperparameters tuning for topic models (ARTM approach) using evolutionary algorithms☆27Updated 5 months ago
- Репозиторий курса "Практические аспекты обучения больших языковых моделей", ВМК МГУ, осень, 2024☆16Updated 6 months ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆12Updated last year
- Deep Learning for Speech☆93Updated 5 months ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 8 months ago
- Tools and agents for automated research.☆30Updated 2 weeks ago
- ☆10Updated last year
- https://arxiv.org/abs/2201.06499☆29Updated last year
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- MMLU eval for RU/EN☆15Updated last year