Python interface to http://opencorpora.org/
☆45Oct 11, 2020Updated 5 years ago
Alternatives and similar repositories for opencorpora-tools
Users that are interested in opencorpora-tools are comparing it to the libraries listed below
Sorting:
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 3 months ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- Scripts for updating pymorphy2 dictionaries☆37May 2, 2024Updated last year
- ☆51Nov 20, 2017Updated 8 years ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- Репозиторий спортивного направления DMIA, весна 2019☆23May 21, 2021Updated 4 years ago
- Solution for the Black Box Challenge☆10Jun 8, 2016Updated 9 years ago
- ☆13Nov 2, 2025Updated 4 months ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- ☆33Sep 20, 2017Updated 8 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆42Oct 10, 2025Updated 4 months ago
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 7 years ago
- Морфологический анализатор русского языка☆42Jun 2, 2025Updated 9 months ago
- MLS is a set of tools to wrap your ML solutions in web services☆15Mar 19, 2018Updated 7 years ago
- The set of Apache UIMA addons & utilities.Some of them are language-independent. The others may be Russian language-specific.☆28Oct 8, 2021Updated 4 years ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- ☆30Dec 25, 2022Updated 3 years ago
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,166Jun 26, 2024Updated last year
- Данные 6-го издания «Грамматического словаря русского языка» А. А. Зализняка (2010) в виде текстовых файлов☆25Sep 17, 2024Updated last year
- Python wrapper for PullEnti☆21Jul 31, 2020Updated 5 years ago
- Materials for Data Science Journey 2017☆39Aug 8, 2022Updated 3 years ago
- Python integration for the GATE framework☆21Nov 6, 2024Updated last year
- A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary …☆292Feb 9, 2022Updated 4 years ago
- ☆24Jun 25, 2025Updated 8 months ago
- ☆56May 12, 2018Updated 7 years ago
- Habrahabr API Client Library for Python☆37Jun 20, 2014Updated 11 years ago
- Russian GPT2 model☆61Jul 12, 2021Updated 4 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Common scripts, mainly for text processing and experimental control☆20Aug 24, 2012Updated 13 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 8 years ago
- Declarative validation with async validators support☆12Aug 27, 2018Updated 7 years ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a…☆92Feb 27, 2025Updated last year