My NLP datasets for Russian language
☆386Feb 18, 2023Updated 3 years ago
Alternatives and similar repositories for NLP_Datasets
Users that are interested in NLP_Datasets are comparing it to the libraries listed below
Sorting:
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated 2 weeks ago
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- Русскоязычный генеративный чатбот с профилем и фактами☆261Jan 20, 2023Updated 3 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- A Russian data set for question answering over Wikidata☆49Jun 6, 2021Updated 4 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Rule-based token, sentence segmentation for Russian language☆278Jul 24, 2023Updated 2 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- Deep Learning based NLP modeling for Russian language☆241Jul 24, 2023Updated 2 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- The tiniest sentence encoder for Russian language☆247Jul 25, 2024Updated last year
- Accentor and transcriptor for Russian language☆133Jun 19, 2022Updated 3 years ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Compact high quality word embeddings for Russian language☆214Jul 24, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,312Oct 17, 2024Updated last year
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- Грамматический Словарь Русского Языка (+ английский, японский, etc)☆77Aug 10, 2020Updated 5 years ago
- Russian GPT3 models.☆2,094Dec 12, 2022Updated 3 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆41Oct 10, 2025Updated 4 months ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆24Aug 23, 2019Updated 6 years ago
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆51Feb 16, 2021Updated 5 years ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆372Nov 24, 2021Updated 4 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆21Apr 1, 2021Updated 4 years ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- Russian RoBERTa☆31Nov 29, 2019Updated 6 years ago
- python package russtress accentuates russian text☆63May 13, 2020Updated 5 years ago
- Russian text segmenter and tokenizer☆18Mar 2, 2021Updated 4 years ago
- Augmentex — a library for augmenting texts with errors☆69Jul 3, 2024Updated last year
- Библиотека для извлечения статистик из текстов на русском языке.☆124Jan 21, 2023Updated 3 years ago
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆56May 27, 2023Updated 2 years ago
- Лемматизатор для русскоязычных текстов☆46Jun 4, 2020Updated 5 years ago
- Using transformers to generate Russian poetry☆37Aug 21, 2023Updated 2 years ago