A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.
☆52Jul 4, 2018Updated 7 years ago
Alternatives and similar repositories for ru_sentence_tokenizer
Users that are interested in ru_sentence_tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆158May 22, 2024Updated last year
- ☆56May 12, 2018Updated 7 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 7 years ago
- ☆30Dec 25, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Russian language models for spaCy☆242Jul 14, 2021Updated 4 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated 3 weeks ago
- ☆36Dec 8, 2022Updated 3 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- Python wrapper for PullEnti☆21Jul 31, 2020Updated 5 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆37Oct 6, 2021Updated 4 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- [UNSUPPORTED] - please use https://github.com/kmike/pymorphy2. Russian and English morphology analyser (POS tagger + inflection engine) w…☆41Jul 23, 2015Updated 10 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compact high quality word embeddings for Russian language☆218Apr 13, 2026Updated 3 weeks ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 5 months ago
- ☆87Oct 19, 2022Updated 3 years ago
- ☆35Sep 20, 2017Updated 8 years ago
- Dataset collected from popular Russian collective blog Habrahabr.ru☆13Oct 24, 2016Updated 9 years ago
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 6 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- Accentor and transcriptor for Russian language☆136Jun 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆21Jul 28, 2020Updated 5 years ago
- Links to Russian corpora + Python functions for loading and parsing☆312Apr 21, 2026Updated last week
- Topic modeling with BigARTM: an interactive book☆61Dec 5, 2018Updated 7 years ago
- Mini-library for producing graph visualizations from embedding models☆28Sep 10, 2020Updated 5 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆56May 27, 2023Updated 2 years ago
- a tor socks proxy docker image☆12Apr 8, 2026Updated 3 weeks ago
- Russian SuperGLUE benchmark☆112Jun 12, 2023Updated 2 years ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆74Apr 13, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Russian names parsers, gender identification and processing tools☆138Dec 6, 2023Updated 2 years ago
- ☆16Apr 10, 2026Updated 3 weeks ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- ☆10Jul 21, 2017Updated 8 years ago
- ☆18May 8, 2018Updated 7 years ago
- My NLP datasets for Russian language☆390Feb 18, 2023Updated 3 years ago
- A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary …☆293Feb 9, 2022Updated 4 years ago