IS2AI / ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
☆50Updated 3 years ago
Alternatives and similar repositories for ISSAI_SAIDA_Kazakh_ASR:
Users that are interested in ISSAI_SAIDA_Kazakh_ASR are comparing it to the libraries listed below
- ☆20Updated 5 years ago
- ☆11Updated 2 years ago
- Accentor and transcriptor for Russian language☆123Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆119Updated 4 years ago
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆20Updated 3 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- ☆11Updated 3 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆33Updated 7 months ago
- ☆34Updated this week
- Normalize Text in Russian☆26Updated last year
- ☆22Updated 3 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 3 years ago
- Speech analytics package for call-center☆23Updated 4 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated 2 years ago
- An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has in…☆126Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- G2P tool for Russian language with vosk-model-ru styled transcriptions☆9Updated 3 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Updated 2 years ago
- Tacotron2 + Waveglow Russian☆43Updated 5 years ago
- ☆16Updated this week
- ☆56Updated 2 years ago
- python package russtress accentuates russian text☆52Updated 4 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆55Updated last year
- ☆13Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago