Koziev / rutokenizer
Russian text segmenter and tokenizer
☆15Updated 3 years ago
Alternatives and similar repositories for rutokenizer:
Users that are interested in rutokenizer are comparing it to the libraries listed below
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 3 years ago
- Part-of-Speech Tagger for Russian language☆21Updated 4 years ago
- Russian Text Expansion based on ruGPT3Large☆25Updated 2 years ago
- Простая модель расстановки запятых на основе BERT☆40Updated 4 years ago
- Deep Learning based NLP modeling for Russian language☆228Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- ☆120Updated last year
- Библиотека для извлечения статистик из текстов на русском языке.☆115Updated 2 years ago
- Generate questions based on text in Russian☆28Updated 3 years ago
- ☆57Updated last year
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆39Updated 3 years ago
- Library for Russian rap generation.☆23Updated 3 years ago
- Russian SuperGLUE benchmark☆109Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆147Updated 2 months ago
- A Python wrapper for the RuWordNet thesaurus☆59Updated 2 months ago
- Russian GPT2 model☆59Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆103Updated 3 years ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- Foundational Model for Speech Recognition Tasks☆172Updated 2 months ago
- Лемматизатор для русскоязычных текстов☆44Updated 4 years ago
- python package russtress accentuates russian text☆51Updated 4 years ago
- ☆211Updated 3 years ago
- Accentor and transcriptor for Russian language☆122Updated 2 years ago
- Compact high quality word embeddings for Russian language☆192Updated last year
- Large silver standart Russian corpus with NER, morphology and syntax markup☆63Updated last year
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆52Updated 4 years ago
- Простой нормализатор текстов перед синтезом речи☆25Updated 9 months ago
- Tool for Information extraction from Russian texts☆5Updated 3 months ago
- Rule-based token, sentence segmentation for Russian language☆257Updated last year
- Простой расстановщик ударений с обработкой омографов☆109Updated 3 months ago