nlpub / pymystem3Links
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggest…
☆294Updated 3 years ago
Alternatives and similar repositories for pymystem3
Users that are interested in pymystem3 are comparing it to the libraries listed below
Sorting:
- Rule-based facts extraction for Russian language☆324Updated 2 years ago
- Sentiment analysis library for russian language☆316Updated last year
- Russian language models for spaCy☆240Updated 4 years ago
- Rule-based token, sentence segmentation for Russian language☆271Updated 2 years ago
- Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются☆137Updated 5 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆142Updated 2 years ago
- Term extraction for Russian language☆89Updated 6 years ago
- Deep Learning based NLP modeling for Russian language☆235Updated 2 years ago
- Links to Russian corpora + Python functions for loading and parsing☆302Updated 2 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆155Updated last year
- Библиотека для анализа и генерации стихов на русском языке☆177Updated last year
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,154Updated last year
- Compact high quality word embeddings for Russian language☆202Updated 2 years ago
- Попытка сделать свой GLR-парсер для русского языка на Python☆142Updated 8 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,268Updated 10 months ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆370Updated 3 years ago
- Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ☆89Updated 8 years ago
- Russian language support for NLTK's PunktSentenceTokenizer☆55Updated 6 years ago
- A web-based engine for creating and annotating textual corpora☆247Updated 2 years ago
- ☆496Updated 4 years ago
- Russian names parsers, gender identification and processing tools☆133Updated last year
- Russian stopwords collection☆74Updated 3 years ago
- My NLP datasets for Russian language☆375Updated 2 years ago
- A Python wrapper for the RuWordNet thesaurus☆68Updated 9 months ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆70Updated 2 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆40Updated 4 years ago
- A list of pretrained Transformer models for the Russian language.☆174Updated 5 years ago
- ☆28Updated 2 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Updated 7 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆123Updated 2 years ago