MashaPo / russtress
python package russtress accentuates russian text
☆50Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for russtress
- Accentor and transcriptor for Russian language☆118Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆116Updated 3 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 3 months ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆100Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆57Updated 3 years ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆14Updated 4 years ago
- SpaCy official Russian model proposal☆31Updated 3 years ago
- Custom Russian tokenizer for spaCy☆42Updated 5 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆152Updated 5 months ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆40Updated 3 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆61Updated last year
- Простая модель расстановки запятых на основе BERT☆40Updated 4 years ago
- Comparing quality and performance of NLP systems for Russian language☆44Updated last year
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆20Updated 3 years ago
- Russian data from the SynTagRus corpus.☆80Updated 6 months ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆26Updated 2 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neura…☆23Updated 5 years ago
- ☆36Updated last year
- ☆34Updated 7 years ago
- ☆48Updated 6 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆19Updated 2 years ago
- Russian SuperGLUE benchmark☆108Updated last year
- ☆13Updated last year
- Russian language support for NLTK's PunktSentenceTokenizer☆52Updated 5 years ago
- Speech analytics package for call-center☆22Updated 3 years ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago