The tiniest sentence encoder for Russian language
☆247Jul 25, 2024Updated last year
Alternatives and similar repositories for encodechka
Users that are interested in encodechka are comparing it to the libraries listed below
Sorting:
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆165Dec 8, 2025Updated 2 months ago
- Effective LLM Alignment Toolkit☆152Jun 25, 2025Updated 8 months ago
- Language modeling and instruction tuning for Russian☆466Aug 20, 2024Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Mar 20, 2025Updated 11 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Mar 16, 2023Updated 2 years ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- Rule-based token, sentence segmentation for Russian language☆278Jul 24, 2023Updated 2 years ago
- ☆34Apr 14, 2025Updated 10 months ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- Tools and agents for automated research.☆50Dec 5, 2025Updated 2 months ago
- My NLP datasets for Russian language☆387Feb 18, 2023Updated 3 years ago
- Augmentex — a library for augmenting texts with errors☆69Jul 3, 2024Updated last year
- ☆414Oct 23, 2024Updated last year
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆36Oct 6, 2021Updated 4 years ago
- Library for industrial alignment.☆405Sep 24, 2025Updated 5 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Oct 7, 2024Updated last year
- RuNNE☆12Jul 17, 2024Updated last year
- Deep Learning based NLP modeling for Russian language☆241Jul 24, 2023Updated 2 years ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆543Dec 22, 2025Updated 2 months ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆35Jul 16, 2022Updated 3 years ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Oct 11, 2024Updated last year
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated 2 weeks ago
- A list of initiatives for adding new languages to opensource machine translation models☆21Dec 2, 2025Updated 2 months ago
- A new second practical assignment for Huawei's NLP course☆19Mar 20, 2024Updated last year
- Библиотека для извлечения статистик из текстов на русском языке.☆124Jan 21, 2023Updated 3 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Feb 19, 2026Updated last week
- ☆28Jan 13, 2026Updated last month
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 10 months ago
- Compact high quality word embeddings for Russian language☆213Jul 24, 2023Updated 2 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,312Oct 17, 2024Updated last year
- The project to find correlation between tweets and future stock prices☆12Feb 28, 2023Updated 3 years ago
- Русскоязычный генеративный чатбот с профилем и фактами☆261Jan 20, 2023Updated 3 years ago
- Tools for shrinking fastText models (in gensim format)☆183May 3, 2024Updated last year
- Материалы курса на Stepik "Нейронные сети и обработка текста"☆186Mar 22, 2024Updated last year
- ☆18Jun 18, 2021Updated 4 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆36Mar 2, 2025Updated 11 months ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago