saarus72 / text_normalizationLinks
T5-based (russian) text normalization
☆22Updated last year
Alternatives and similar repositories for text_normalization
Users that are interested in text_normalization are comparing it to the libraries listed below
Sorting:
- Simple audio AE☆12Updated 9 months ago
- Простой нормализатор текстов перед синтезом речи☆37Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 5 months ago
- ☆13Updated 2 years ago
- ☆53Updated 2 weeks ago
- ☆13Updated 4 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 10 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆23Updated 4 months ago
- Normalize Text in Russian☆27Updated last year
- Train punctuation and capitalization models for different languages☆25Updated 3 years ago
- Effective LLM Alignment Toolkit☆140Updated 2 months ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- Russian open TTS dataset☆17Updated 5 years ago
- Framework for processing and filtering datasets☆27Updated last year
- Top ML papers of the week.☆39Updated this week
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 4 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆37Updated last year
- ☆56Updated 6 months ago
- ☆28Updated 2 months ago
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆40Updated 2 years ago
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆50Updated 4 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆158Updated 8 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- Простая модель расстановки запятых на основе BERT☆40Updated 5 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 6 months ago
- A list of initiatives for adding new languages to opensource machine translation models☆20Updated 3 weeks ago
- The python library and service for automatic speech recognition and transcribing in Russian and English☆61Updated 8 months ago
- Deep Learning for Speech☆93Updated 7 months ago
- Русско-Английский вокодер на GAN☆17Updated 4 years ago