Emotional-Text-to-Speech / hmm-for-emo-tts
A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech from text
☆48Updated 2 years ago
Alternatives and similar repositories for hmm-for-emo-tts:
Users that are interested in hmm-for-emo-tts are comparing it to the libraries listed below
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated 3 weeks ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- ☆75Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆54Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆215Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆132Updated 3 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Official code for Wav2Seq☆96Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Updated 3 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆192Updated 3 years ago
- ☆163Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- multilingual speech aligner☆74Updated last year
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆81Updated 2 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- ☆34Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆209Updated last month
- Code for AccentDB.☆20Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago