The tiniest sentence encoder for Russian language
☆246Jul 25, 2024Updated last year
Alternatives and similar repositories for encodechka
Users that are interested in encodechka are comparing it to the libraries listed below
Sorting:
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 3 months ago
- Effective LLM Alignment Toolkit☆152Jun 25, 2025Updated 8 months ago
- Language modeling and instruction tuning for Russian☆465Aug 20, 2024Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- T5-based (russian) text normalization☆26Jan 25, 2024Updated 2 years ago
- Tools and agents for automated research.☆52Dec 5, 2025Updated 3 months ago
- ☆415Oct 23, 2024Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Mar 16, 2023Updated 3 years ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- Rule-based token, sentence segmentation for Russian language☆279Jul 24, 2023Updated 2 years ago
- Library for industrial alignment.☆405Sep 24, 2025Updated 5 months ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆551Updated this week
- ☆28Jan 13, 2026Updated 2 months ago
- The project to find correlation between tweets and future stock prices☆12Feb 28, 2023Updated 3 years ago
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- ☆34Apr 14, 2025Updated 11 months ago
- Augmentex — a library for augmenting texts with errors☆69Jul 3, 2024Updated last year
- ☆22Jun 10, 2025Updated 9 months ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆35Jul 16, 2022Updated 3 years ago
- A new second practical assignment for Huawei's NLP course☆19Mar 20, 2024Updated 2 years ago
- Tools for shrinking fastText models (in gensim format)☆183May 3, 2024Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆36Oct 6, 2021Updated 4 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆20Feb 8, 2026Updated last month
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- ☆31Sep 23, 2024Updated last year
- ☆14Apr 22, 2025Updated 10 months ago
- A list of initiatives for adding new languages to opensource machine translation models☆21Dec 2, 2025Updated 3 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆125Jan 21, 2023Updated 3 years ago
- Compact high quality word embeddings for Russian language☆217Jul 24, 2023Updated 2 years ago
- RuNNE☆12Jul 17, 2024Updated last year
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Oct 11, 2024Updated last year
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 11 months ago
- Jupyter Notebooks and other files from my video tutorial series about GigaChat API☆78Dec 4, 2025Updated 3 months ago
- Links to Russian corpora + Python functions for loading and parsing☆310Feb 9, 2026Updated last month
- Deep Learning based NLP modeling for Russian language☆243Jul 24, 2023Updated 2 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,314Oct 17, 2024Updated last year