Rule-based token, sentence segmentation for Russian language
☆280Jul 24, 2023Updated 2 years ago
Alternatives and similar repositories for razdel
Users that are interested in razdel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Learning based NLP modeling for Russian language☆243Jul 24, 2023Updated 2 years ago
- Compact high quality word embeddings for Russian language☆217Jul 24, 2023Updated 2 years ago
- Rule-based facts extraction for Russian language☆331Jul 24, 2023Updated 2 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Links to Russian corpora + Python functions for loading and parsing☆310Feb 9, 2026Updated last month
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,316Oct 17, 2024Updated last year
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- ☆51Nov 20, 2017Updated 8 years ago
- NER, syntax markup visualizations☆140Feb 9, 2026Updated last month
- Comparing quality and performance of NLP systems for Russian language☆50Jul 24, 2023Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- ☆18Jun 18, 2021Updated 4 years ago
- ☆56May 12, 2018Updated 7 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 4 months ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- Sentiment analysis library for russian language☆320Oct 30, 2023Updated 2 years ago
- Python wrapper for PullEnti☆21Jul 31, 2020Updated 5 years ago
- ☆28Jan 13, 2026Updated 2 months ago
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆125Jan 21, 2023Updated 3 years ago
- ☆35Sep 20, 2017Updated 8 years ago
- Datasets for evaluation of keyword extraction in Russian☆31Sep 23, 2020Updated 5 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆41Mar 18, 2021Updated 5 years ago
- The tiniest sentence encoder for Russian language☆246Jul 25, 2024Updated last year
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,169Jun 26, 2024Updated last year
- Topic modeling with BigARTM: an interactive book☆60Dec 5, 2018Updated 7 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 3 months ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Unsupervised text tokenizer focused on computational efficiency☆977Mar 29, 2024Updated last year
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆129Oct 9, 2025Updated 5 months ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆123Mar 15, 2021Updated 5 years ago
- Russian language support for NLTK's PunktSentenceTokenizer☆55Jul 10, 2019Updated 6 years ago
- Accentor and transcriptor for Russian language☆134Jun 19, 2022Updated 3 years ago
- A list of initiatives for adding new languages to opensource machine translation models☆21Dec 2, 2025Updated 3 months ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 11 months ago
- Jupyter Widget for data annotation☆140Jan 6, 2023Updated 3 years ago