Custom Russian tokenizer for spaCy
☆44May 14, 2019Updated 6 years ago
Alternatives and similar repositories for spacy_russian_tokenizer
Users that are interested in spacy_russian_tokenizer are comparing it to the libraries listed below
Sorting:
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- Pre-trained models for tokenization, sentence segmentation and so on☆15Aug 22, 2017Updated 8 years ago
- Russian RoBERTa☆31Nov 29, 2019Updated 6 years ago
- Russian Law as Open Data☆49Feb 5, 2026Updated last month
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated 3 weeks ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆57Dec 8, 2016Updated 9 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Compact high quality word embeddings for Russian language☆214Jul 24, 2023Updated 2 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 3 years ago
- A Python program, running as an independent process, that provides a 'proxy like' service for experiment runtimes ( psychopy ) and device…☆19May 8, 2013Updated 12 years ago
- Unofficial ontologies for Official Registers of Russian Federal Tax Service☆10Apr 7, 2018Updated 7 years ago
- Utility to run separate X with discrete nvidia graphics with full performance adapted to work on Debian 9. in a Lenovo Yoga☆11Dec 20, 2018Updated 7 years ago
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆12Jul 15, 2016Updated 9 years ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- Vue component to easy select time intervals. Available at npm. 2018☆15Mar 1, 2019Updated 7 years ago
- CLI tool to automatically crawl and download all photos on a media platform without having to visit and save each photo manually, saving …☆12Feb 20, 2016Updated 10 years ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 3 years ago
- Simple program that hide/show the icons of Windows desktop.☆11Sep 29, 2023Updated 2 years ago
- 20 python libs and more: read me first!☆12Apr 11, 2024Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Scrape financial terms from Investopedia☆12Sep 7, 2018Updated 7 years ago
- Learning to generate an image of the Mona Lisa, pixel-by-pixel, using a deep neural network.☆17May 4, 2013Updated 12 years ago
- Database for experiments with russian voxforge audio data (http://voxforge.org/ru/downloads).☆14Aug 31, 2021Updated 4 years ago
- Javascript bayesian network library, inference, learning☆15Nov 8, 2019Updated 6 years ago
- Basic chat example to demonstrate I/O and other rust features.☆13Nov 10, 2025Updated 3 months ago
- This is a postfix content filter written in BASH scripting language which automatically encrypt email messages with email encryption stan…☆10Nov 8, 2021Updated 4 years ago
- a small collection of models implemented in keras, including matrix factorization(recommendation system), topic modeling, text classifica…☆14Jul 12, 2017Updated 8 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago