Emotional-Text-to-Speech / dl-for-emo-ttsView external linksLinks
A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech
☆458Jun 26, 2024Updated last year
Alternatives and similar repositories for dl-for-emo-tts
Users that are interested in dl-for-emo-tts are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆316Aug 25, 2021Updated 4 years ago
- This is the GitHub page for publicly available emotional speech data.☆381Jan 6, 2022Updated 4 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017☆72Aug 22, 2019Updated 6 years ago
- ☆121Oct 24, 2022Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 8 months ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆600Sep 18, 2023Updated 2 years ago
- List of speech synthesis papers.☆1,065Jul 24, 2023Updated 2 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆283Oct 10, 2023Updated 2 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Nov 9, 2022Updated 3 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech from text☆50Dec 30, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.☆329Feb 9, 2024Updated 2 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆701Jul 12, 2022Updated 3 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Apr 10, 2024Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs☆347Feb 21, 2022Updated 3 years ago
- Official Implementation of StyleTTS☆461Jan 13, 2025Updated last year
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Mar 6, 2024Updated last year