cnlinxi / style-token_tacotron2
style token with tacotron2
☆61Updated last year
Alternatives and similar repositories for style-token_tacotron2:
Users that are interested in style-token_tacotron2 are comparing it to the libraries listed below
- ☆51Updated 6 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2☆81Updated 4 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 4 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Updated 4 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆25Updated 4 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Implementation of the AlignTTS☆76Updated last year
- ☆45Updated 5 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆102Updated 3 years ago
- Chinese Text Normalization and Dataset☆82Updated 2 years ago
- ☆69Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Efficient neural speech synthesis☆80Updated 4 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated 7 months ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆72Updated 5 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- ☆34Updated 5 years ago
- ☆74Updated 2 years ago