saarus72 / text_normalization
T5-based (russian) text normalization
☆20Updated last year
Alternatives and similar repositories for text_normalization:
Users that are interested in text_normalization are comparing it to the libraries listed below
- Simple audio AE☆12Updated 3 months ago
- ☆13Updated 3 years ago
- Простой нормализатор текстов перед синтезом речи☆25Updated 9 months ago
- Normalize Text in Russian☆26Updated last year
- Простой IPA фонемизатор на базе ruaccent-encoder☆17Updated 4 months ago
- ☆13Updated 2 years ago
- Framework for processing and filtering datasets☆27Updated 6 months ago
- Russian open TTS dataset☆12Updated 5 years ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- Training BERT for punctuation task☆10Updated 4 years ago
- ☆37Updated 3 weeks ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 4 months ago
- A list of initiatives for adding new languages to opensource machine translation models☆17Updated 3 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆38Updated 2 months ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 3 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆31Updated 6 months ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated last year
- Neural model for prediction of stress position in Russian words☆11Updated last year
- ☆43Updated last week
- Augmentex — a library for augmenting texts with errors☆61Updated 7 months ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated 8 months ago
- ☆39Updated 8 months ago
- Effective LLM Alignment Toolkit☆115Updated this week
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated last week
- Using transformers to generate Russian poetry☆35Updated last year
- Простой расстановщик ударений с обработкой омографов☆109Updated 3 months ago