Adibian / Persian-MultiSpeaker-Tacotron2Links
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆10Updated 4 months ago
Alternatives and similar repositories for Persian-MultiSpeaker-Tacotron2
Users that are interested in Persian-MultiSpeaker-Tacotron2 are comparing it to the libraries listed below
Sorting:
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆18Updated 3 years ago
- ☆12Updated 4 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 8 months ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated 11 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆11Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- ☆17Updated 2 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated 2 months ago
- ☆13Updated 10 months ago
- Implementation of Emo-StarGAN☆45Updated last year
- The Vokan Architecture (Tsukasa speech based)☆10Updated 4 months ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆10Updated last month
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- An extension of PHOIBLE that includes features for allophones.☆10Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Updated 2 years ago
- ☆17Updated 4 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 8 months ago
- ☆13Updated 7 months ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- This is the experimental description of MnTTS2.☆11Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year