Koziev / rutokenizerLinks
Russian text segmenter and tokenizer
☆18Updated 4 years ago
Alternatives and similar repositories for rutokenizer
Users that are interested in rutokenizer are comparing it to the libraries listed below
Sorting:
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 4 years ago
- Russian SuperGLUE benchmark☆112Updated 2 years ago
- python package russtress accentuates russian text☆59Updated 5 years ago
- ☆57Updated last year
- Deep Learning based NLP modeling for Russian language☆239Updated 2 years ago
- Rule-based token, sentence segmentation for Russian language☆276Updated 2 years ago
- ☆212Updated 4 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105Updated 4 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆122Updated 4 years ago
- Russian GPT2 model☆60Updated 4 years ago
- Compact high quality word embeddings for Russian language☆206Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆71Updated 2 years ago
- Простая модель расстановки запятых на основе BERT☆40Updated 5 years ago
- Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa☆15Updated 2 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆162Updated 11 months ago
- Accentor and transcriptor for Russian language☆128Updated 3 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆124Updated 2 years ago
- CLIP implementation for Russian language☆147Updated 2 years ago
- Russian/English/Estonian/Finnish/Swedish phonetic algorithm based on Soundex and Metaphone☆52Updated 8 months ago
- Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge☆58Updated 3 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated 2 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Updated 2 years ago
- A Python wrapper for the RuWordNet thesaurus☆70Updated 11 months ago
- My NLP datasets for Russian language☆380Updated 2 years ago
- Russian Text Expansion based on ruGPT3Large☆25Updated 3 years ago
- Русскоязычный генеративный чатбот с профилем и фактами☆262Updated 2 years ago
- ☆134Updated 5 months ago
- Foundational Model for Speech Recognition Tasks☆327Updated 3 months ago
- Using transformers to generate Russian poetry☆36Updated 2 years ago
- Training GPT-2 on a Russian language corpus☆87Updated 4 years ago