Adibian / Persian-MultiSpeaker-Tacotron2Links
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆12Updated last month
Alternatives and similar repositories for Persian-MultiSpeaker-Tacotron2
Users that are interested in Persian-MultiSpeaker-Tacotron2 are comparing it to the libraries listed below
Sorting:
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆20Updated 3 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated last year
- Sharif Emotional Speech Database☆38Updated 4 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆13Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- A large-scale validated database for Persian speech emotion detection.☆24Updated 3 years ago
- ☆14Updated last year
- ☆11Updated 2 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆13Updated 6 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated 2 months ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Updated 5 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- Bert-Based persian spell-checker☆19Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆12Updated 8 months ago
- A Text-To-Speech Model Developed Using 🐸STT☆12Updated 3 years ago
- The Vokan Architecture (Tsukasa speech based)☆10Updated 9 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Updated 5 years ago
- ☆17Updated 4 years ago
- fine-tune Wav2vec2. an ASR model released by Facebook☆38Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year