Koziev / rutokenizer
Russian text segmenter and tokenizer
☆16Updated 4 years ago
Alternatives and similar repositories for rutokenizer:
Users that are interested in rutokenizer are comparing it to the libraries listed below
- Russian Text Expansion based on ruGPT3Large☆25Updated 2 years ago
- Part-of-Speech Tagger for Russian language☆21Updated 4 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 3 years ago
- Generate questions based on text in Russian☆28Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆103Updated 3 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆119Updated 4 years ago
- ☆57Updated last year
- Deep Learning based NLP modeling for Russian language☆230Updated last year
- python package russtress accentuates russian text☆52Updated 4 years ago
- Russian SuperGLUE benchmark☆109Updated last year
- ☆212Updated 3 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆118Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆64Updated last year
- Простая модель расстановки запятых на основе BERT☆40Updated 4 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated last month
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 3 months ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆40Updated 4 years ago
- ☆121Updated this week
- Reproducing http://kingjamesprogramming.tumblr.com and having fun.☆43Updated 5 years ago
- Foundational Model for Speech Recognition Tasks☆189Updated 3 weeks ago
- A Python wrapper for the RuWordNet thesaurus☆60Updated 4 months ago
- Лемматизатор для русскоязычных текстов☆44Updated 4 years ago
- Tacotron2 + Waveglow Russian☆43Updated 5 years ago
- Library for Russian rap generation.☆23Updated 3 years ago
- Using transformers to generate Russian poetry☆35Updated last year
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆26Updated 2 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- CLIP implementation for Russian language☆144Updated last year
- Rule-based token, sentence segmentation for Russian language☆261Updated last year
- Accentor and transcriptor for Russian language☆123Updated 2 years ago