A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggest…
☆292Feb 9, 2022Updated 4 years ago
Alternatives and similar repositories for pymystem3
Users that are interested in pymystem3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,169Jun 26, 2024Updated last year
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,316Oct 17, 2024Updated last year
- Rule-based facts extraction for Russian language☆331Jul 24, 2023Updated 2 years ago
- ☆499Nov 16, 2020Updated 5 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 4 months ago
- ☆51Nov 20, 2017Updated 8 years ago
- Deep Learning based NLP modeling for Russian language☆243Jul 24, 2023Updated 2 years ago
- Russian morphological tagset converters library.☆42Oct 4, 2019Updated 6 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆145Nov 19, 2022Updated 3 years ago
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- Evaluation tools for the RUSSE evaluation campaign.☆37Jun 11, 2017Updated 8 years ago
- Sentiment analysis library for russian language☆320Oct 30, 2023Updated 2 years ago
- Python interface to http://opencorpora.org/☆45Oct 11, 2020Updated 5 years ago
- Rule-based token, sentence segmentation for Russian language☆279Jul 24, 2023Updated 2 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- Yandex Mystem makes morphological analysis of a russian text☆28Feb 15, 2018Updated 8 years ago
- Russian SuperGLUE benchmark☆112Jun 12, 2023Updated 2 years ago
- Materials for Data Science Journey 2017☆39Aug 8, 2022Updated 3 years ago
- Links to Russian corpora + Python functions for loading and parsing☆310Feb 9, 2026Updated last month
- Compact high quality word embeddings for Russian language☆217Jul 24, 2023Updated 2 years ago
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- python package russtress accentuates russian text☆65May 13, 2020Updated 5 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- A Parallel Russian-Simple Russian Dataset☆15Mar 30, 2023Updated 2 years ago
- Russian language support for NLTK's PunktSentenceTokenizer☆55Jul 10, 2019Updated 6 years ago
- Named Entity Recognition☆337May 22, 2023Updated 2 years ago
- Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ☆93Apr 4, 2017Updated 8 years ago
- ☆34Sep 20, 2017Updated 8 years ago
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆129Oct 9, 2025Updated 5 months ago
- ☆10Jul 21, 2017Updated 8 years ago
- ☆18Apr 25, 2018Updated 7 years ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago