☆21Jun 16, 2021Updated 4 years ago
Alternatives and similar repositories for EmotionControllableTextToSpeech
Users that are interested in EmotionControllableTextToSpeech are comparing it to the libraries listed below
Sorting:
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017☆72Aug 22, 2019Updated 6 years ago
- ☆37May 8, 2021Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- ☆121Oct 24, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Nov 9, 2022Updated 3 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆46Nov 3, 2021Updated 4 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆318Aug 25, 2021Updated 4 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆168Apr 10, 2024Updated last year
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆129Apr 8, 2023Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- ☆31Nov 7, 2018Updated 7 years ago
- Speech samples and code of BEdit-TTS☆34Oct 8, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆16Mar 25, 2025Updated 11 months ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Apr 9, 2021Updated 4 years ago
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆15Jan 14, 2022Updated 4 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- ☆16Dec 23, 2021Updated 4 years ago