saarus72 / text_normalization
T5-based (russian) text normalization
☆19Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for text_normalization
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆24Updated 3 weeks ago
- ☆13Updated last week
- ☆13Updated 3 years ago
- Простой нормализатор текстов перед синтезом речи☆20Updated 6 months ago
- Simple audio AE☆11Updated last week
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated 5 months ago
- ☆13Updated last year
- Effective LLM Alignment Toolkit☆87Updated 3 weeks ago
- Normalize Text in Russian☆24Updated last year
- Top ML papers of the week.☆21Updated this week
- Neural model for prediction of stress position in Russian words☆11Updated last year
- Framework for processing and filtering datasets☆25Updated 3 months ago
- Training BERT for punctuation task☆10Updated 4 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆58Updated last month
- Простой IPA фонемизатор на базе ruaccent-encoder☆14Updated last month
- Using transformers to generate Russian poetry☆35Updated last year
- Russian open TTS dataset☆12Updated 5 years ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆15Updated last month
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆17Updated last year
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 3 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆132Updated last month
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆57Updated 3 years ago
- Foundational Model for Speech Recognition Tasks☆113Updated 5 months ago
- ☆21Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated 9 months ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 2 years ago
- Простая модель расстановки запятых на основе BERT☆40Updated 4 years ago
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆36Updated 2 years ago