IS2AI / TurkicTTS
A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.
☆55Updated last year
Alternatives and similar repositories for TurkicTTS:
Users that are interested in TurkicTTS are comparing it to the libraries listed below
- ☆11Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆62Updated 8 months ago
- ☆34Updated last week
- ☆80Updated 9 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆162Updated 11 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆33Updated 7 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 10 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データ セット☆24Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Linguistic processing for Common Voice☆53Updated last year
- Various speech datasets made available to the public☆113Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆69Updated 4 months ago
- ☆38Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆39Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 9 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆50Updated 3 weeks ago
- The VoxTube dataset official repository☆68Updated last year
- This is the M-AILABS Speech Dataset☆44Updated 3 months ago
- ☆18Updated 2 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- ☆17Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆24Updated 3 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆61Updated last week
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆33Updated last year
- asr2k☆49Updated 9 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year