Corpus of Russian news articles collected from Lenta.Ru
☆146Nov 19, 2022Updated 3 years ago
Alternatives and similar repositories for Lenta.Ru-News-Dataset
Users that are interested in Lenta.Ru-News-Dataset are comparing it to the libraries listed below
Sorting:
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ☆93Apr 4, 2017Updated 8 years ago
- ☆56May 12, 2018Updated 7 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆57Dec 8, 2016Updated 9 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated last month
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- Dataset collected from popular Russian collective blog Habrahabr.ru☆13Oct 24, 2016Updated 9 years ago
- Classification and aggregation of russian news articles. University coursework.☆18Jan 21, 2019Updated 7 years ago
- ☆36Dec 8, 2022Updated 3 years ago
- Datasets for evaluation of keyword extraction in Russian☆31Sep 23, 2020Updated 5 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 3 months ago
- Topic modeling with BigARTM: an interactive book☆60Dec 5, 2018Updated 7 years ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆371Nov 24, 2021Updated 4 years ago
- Deep Learning based NLP modeling for Russian language☆242Jul 24, 2023Updated 2 years ago
- ☆33Sep 20, 2017Updated 8 years ago
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- Russian NLP datasets☆16Oct 17, 2018Updated 7 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Rule-based facts extraction for Russian language☆330Jul 24, 2023Updated 2 years ago
- Sberbank Data Science Contest 2017. Задача B: построение вопрос-ответной системы.☆11Nov 7, 2018Updated 7 years ago
- Database for experiments with russian voxforge audio data (http://voxforge.org/ru/downloads).☆14Aug 31, 2021Updated 4 years ago
- Open Source framework for developing Dialog Agents☆20Mar 26, 2018Updated 7 years ago
- Clickbait Language Model with Self-Attention☆21Jul 27, 2018Updated 7 years ago
- Differentiable lower bound for BLEU score.☆12Apr 13, 2019Updated 6 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- Rule-based token, sentence segmentation for Russian language☆279Jul 24, 2023Updated 2 years ago
- Rekko Challenge 2nd place☆23Jun 6, 2019Updated 6 years ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆93Apr 12, 2022Updated 3 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated 11 months ago
- ConvAI baseline solution☆50Jul 12, 2017Updated 8 years ago
- Materials for Data Science Journey 2017☆39Aug 8, 2022Updated 3 years ago
- Named Entity Recognition☆336May 22, 2023Updated 2 years ago
- Sentiment analysis library for russian language☆321Oct 30, 2023Updated 2 years ago
- Репозиторий для лекций, семинаров и заданий по курсу "Анализ неструктурированных данных" ФКН ВШЭ☆34Dec 5, 2018Updated 7 years ago