aatimofeev / spacy_russian_tokenizerView external linksLinks
Custom Russian tokenizer for spaCy
☆44May 14, 2019Updated 6 years ago
Alternatives and similar repositories for spacy_russian_tokenizer
Users that are interested in spacy_russian_tokenizer are comparing it to the libraries listed below
Sorting:
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Узнай, хорошо или плохо говорят о тебе или твоей фирме в Интернете! Наша "Сорока" с искусственным интеллектом принесёт тебе это на своём …☆19May 24, 2018Updated 7 years ago
- Russian RoBERTa☆29Nov 29, 2019Updated 6 years ago
- ☆18May 8, 2018Updated 7 years ago
- Russian Law as Open Data☆48Feb 5, 2026Updated last week
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆41Oct 10, 2025Updated 4 months ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Comparing quality and performance of NLP systems for Russian language☆51Jul 24, 2023Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆56May 12, 2018Updated 7 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆72Jul 24, 2023Updated 2 years ago
- Compact high quality word embeddings for Russian language☆210Jul 24, 2023Updated 2 years ago
- Проект для перевода чисел, записанных в текстовом виде на ру сском языке.☆11Apr 5, 2022Updated 3 years ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- A Python program, running as an independent process, that provides a 'proxy like' service for experiment runtimes ( psychopy ) and device…☆19May 8, 2013Updated 12 years ago
- Unofficial ontologies for Official Registers of Russian Federal Tax Service☆10Apr 7, 2018Updated 7 years ago
- 20 python libs and more: read me first!☆12Apr 11, 2024Updated last year
- Simple program that hide/show the icons of Windows desktop.☆11Sep 29, 2023Updated 2 years ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 3 years ago
- Vue component to easy select time intervals. Available at npm. 2018☆16Mar 1, 2019Updated 6 years ago
- Примеры distributed machine learning с помощью сервиса AICloud☆37Nov 18, 2025Updated 2 months ago
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- ☆10Jan 12, 2023Updated 3 years ago
- A distributed network based on hash codes and lattices.☆14Aug 16, 2016Updated 9 years ago
- Qt for Python workshop☆11Nov 23, 2021Updated 4 years ago
- Агрегированный проект методов искусственного интеллекта и машинного обучения☆11Oct 16, 2017Updated 8 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- PostgreSQL extension for working with AcoustID fingerprints☆13Nov 13, 2022Updated 3 years ago
- Visual SPARQL query tool☆10Feb 26, 2016Updated 9 years ago
- a copy of m^2 's fsbench (https://chiselapp.com/user/Justin_be_my_guide/repository/fsbench/) with the latest density updates☆13Dec 30, 2019Updated 6 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Highlight resources and tools for Nvidia Jetson Nano☆10Apr 22, 2019Updated 6 years ago
- Write JDBC ResultSet to Parquet File☆11Apr 14, 2025Updated 10 months ago
- Basic chat example to demonstrate I/O and other rust features.☆13Nov 10, 2025Updated 3 months ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆41Mar 18, 2021Updated 4 years ago
- This is a postfix content filter written in BASH scripting language which automatically encrypt email messages with email encryption stan…☆10Nov 8, 2021Updated 4 years ago