ivanvovk / durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆183Updated 4 years ago
Alternatives and similar repositories for durian-pytorch:
Users that are interested in durian-pytorch are comparing it to the libraries listed below
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 4 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated last year
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 3 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 9 months ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆124Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆243Updated 2 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆201Updated 4 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆115Updated 3 years ago
- ☆111Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆191Updated 2 years ago
- PPG-Based Voice Conversion☆332Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year
- ☆256Updated last year
- Efficient neural speech synthesis☆80Updated 4 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆113Updated 4 years ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆86Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆72Updated 5 years ago
- Official implementation of SpeechSplit2☆130Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆343Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆114Updated 11 months ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆140Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆88Updated 4 years ago