WelkinYang / EMPHASIS-pytorchLinks
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Updated 6 years ago
Alternatives and similar repositories for EMPHASIS-pytorch
Users that are interested in EMPHASIS-pytorch are comparing it to the libraries listed below
Sorting:
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Google's TPGST reimplementation.☆34Updated 5 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 5 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆11Updated 10 years ago
- ☆15Updated 4 years ago
- TTS Text Analyzer☆32Updated 2 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- ☆34Updated 6 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- ☆25Updated last year
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 4 years ago
- ☆25Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
- using world vocoder to extract features and make data for training neural networks☆11Updated 8 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆32Updated 3 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Updated 7 years ago
- Adaptive Vocoder for Custom Voice☆61Updated 3 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆27Updated 2 years ago
- ☆51Updated 6 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Updated 3 years ago
- TPSE-GST Tacotron2☆14Updated 6 years ago
- ☆64Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Updated 4 years ago