Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanEval, MBPP, BBH, MMLU-Pro, GPQA
☆23Apr 16, 2025Updated last year
Alternatives and similar repositories for simple-evals-ru
Users that are interested in simple-evals-ru are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 17, 2024Updated 2 years ago
- Training and data processing code for Saiga☆54Jan 2, 2026Updated 4 months ago
- Train punctuation and capitalization models for different languages☆26Apr 2, 2022Updated 4 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆20Feb 8, 2026Updated 2 months ago
- Tools and agents for automated research.☆53Dec 5, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- Effective LLM Alignment Toolkit☆153Jun 25, 2025Updated 10 months ago
- ☆71Aug 27, 2024Updated last year
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Mar 20, 2025Updated last year
- Using transformers to generate Russian poetry☆36Aug 21, 2023Updated 2 years ago
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Mar 23, 2026Updated last month
- NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities☆30Updated this week
- Repository of a data modeling and analysis tool based on Bayesian networks☆133Nov 10, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆29Jan 13, 2026Updated 3 months ago
- Framework for prototyping of LLM-based applications☆25Apr 16, 2026Updated 2 weeks ago
- TinyTNAS is a hardware-aware, multi-objective, time-bound Neural Architecture Search (NAS) tool designed for TinyML time series classific…☆22Dec 11, 2024Updated last year
- Links to Russian corpora + Python functions for loading and parsing☆312Apr 21, 2026Updated last week
- ☆23Aug 26, 2024Updated last year
- Rule-based token, sentence segmentation for Russian language☆281Apr 13, 2026Updated 3 weeks ago
- RuNNE☆12Jul 17, 2024Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 4 months ago
- комплексное руководство по машинному обучению (ML) и обработке естественного языка (NLP). Этот проект предназначен для студентов техничес…☆30Aug 24, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Probability and statistics exams, Higher School of Economics☆28Jun 7, 2025Updated 10 months ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 4 years ago
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- Top ML papers of the week.☆47Updated this week
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated 2 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Lightweight library for creating services using just Python☆11Aug 1, 2023Updated 2 years ago
- ☆25Jun 12, 2023Updated 2 years ago
- ☆22Oct 30, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Language modeling and instruction tuning for Russian☆462Aug 20, 2024Updated last year
- ☆29Mar 4, 2026Updated 2 months ago
- Небольшие авторские книги / учебные пособия / инструкции☆26Feb 14, 2025Updated last year
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- ☆35Apr 21, 2026Updated last week
- Simulator for training and evaluation of Recommender Systems☆57Mar 24, 2025Updated last year
- Simple projects that demonstrate kweb's capabilities 🦆☆13Aug 2, 2021Updated 4 years ago