Custom Russian tokenizer for spaCy
☆44May 14, 2019Updated 6 years ago
Alternatives and similar repositories for spacy_russian_tokenizer
Users that are interested in spacy_russian_tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Russian language models for spaCy☆242Jul 14, 2021Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- Morphological Analyzer for Russian 💬☆40Jul 14, 2021Updated 4 years ago
- ☆16Sep 3, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18May 8, 2018Updated 7 years ago
- Узнай, хорошо или плохо говорят о тебе или твоей фирме в Интернете! Наша "Сорока" с искусственным интеллектом принесёт тебе это на своём …☆19May 24, 2018Updated 7 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆42Oct 10, 2025Updated 6 months ago
- Russian Law as Open Data☆58Apr 27, 2026Updated last week
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆58Dec 8, 2016Updated 9 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- Links to Russian corpora + Python functions for loading and parsing☆312Apr 21, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- HFST optimized-lookup standalone library and command line tool☆13Feb 27, 2018Updated 8 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated 3 weeks ago
- a small collection of models implemented in keras, including matrix factorization(recommendation system), topic modeling, text classifica…☆14Jul 12, 2017Updated 8 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 4 years ago
- Compact high quality word embeddings for Russian language☆218Apr 13, 2026Updated 3 weeks ago
- Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper☆14Apr 3, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Программирование и теория алгоритмов 2019-2020, ФиКЛ ВШЭ☆12Jun 9, 2020Updated 5 years ago
- Topic modeling with BigARTM: an interactive book☆61Dec 5, 2018Updated 7 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,327Apr 13, 2026Updated 3 weeks ago
- Hungarian tokenizer.☆15Mar 15, 2022Updated 4 years ago
- Decorator class implementation for Python☆12Mar 3, 2017Updated 9 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆74Apr 13, 2026Updated 3 weeks ago
- A system for word sense induction and disambiguation based on JoBimText approach☆16Feb 14, 2018Updated 8 years ago
- Rule-based token, sentence segmentation for Russian language☆281Apr 13, 2026Updated 3 weeks ago
- ☆14Dec 20, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆30Aug 25, 2021Updated 4 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆42Mar 18, 2021Updated 5 years ago
- Веб-версия "Грамматического словаря" А. А. Зализняка☆22Jan 7, 2026Updated 3 months ago
- My NLP datasets for Russian language☆390Feb 18, 2023Updated 3 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Feb 11, 2018Updated 8 years ago
- A miner to do a sociological data-mining from vk.com☆12Jan 26, 2016Updated 10 years ago