IS2AI / ISSAI_SAIDA_Kazakh_ASRLinks
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
☆52Updated 4 years ago
Alternatives and similar repositories for ISSAI_SAIDA_Kazakh_ASR
Users that are interested in ISSAI_SAIDA_Kazakh_ASR are comparing it to the libraries listed below
Sorting:
- ☆21Updated 6 years ago
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆20Updated 4 years ago
- Accentor and transcriptor for Russian language☆126Updated 3 years ago
- ☆13Updated last month
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆37Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆120Updated 4 years ago
- ☆13Updated 2 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- ☆13Updated 4 years ago
- ☆22Updated 4 years ago
- Tacotron2 + Waveglow Russian☆43Updated 5 years ago
- Normalize Text in Russian☆27Updated last year
- An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has in…☆136Updated last month
- ☆37Updated 4 months ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆70Updated 2 years ago
- Baseline convolutional ASR system in PyTorch☆21Updated last year
- Grapheme To Phoneme☆73Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- python package russtress accentuates russian text☆56Updated 5 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆155Updated 5 years ago
- ☆28Updated 3 months ago
- A system for multi-user transcribing speech in audio files.☆35Updated 6 months ago
- ☆56Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- ☆38Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆222Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago