IEEE-NITK / Neural-Voice-Cloning
Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-speaker generative model with a few cloning samples.
☆57Updated 5 years ago
Related projects: ⓘ
- ☆45Updated this week
- Collect Voice Conversion researches☆90Updated this week
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆36Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆79Updated last year
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 3 years ago
- ☆35Updated this week
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆120Updated 3 years ago
- ☆28Updated 4 years ago
- Interface for Controllable Expressive Talking Machine☆37Updated 8 months ago
- VAE Tacotron 2, an alternative of GST Tacotron☆85Updated last year
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated last year
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- A pytorch implementation of StarGAN-VC2☆146Updated 4 years ago
- Implementation of GAN architectures for Voice Conversion☆51Updated 5 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆84Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆110Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- Singing Style Transfer using Deep U-net for vocal separation & CycleConsistencyBoundaryEquilibrium GAN(Cycle-BEGAN) for vocal style trans…☆35Updated 5 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆25Updated 5 years ago
- Deep voice 3 + WORLD vocoder.☆17Updated 4 years ago
- Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2☆81Updated 3 years ago
- Implementation of Multi speaker TTS☆49Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆168Updated last month
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆156Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- Deep Convolution Text to Speech☆35Updated 6 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆58Updated 2 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 2 years ago
- ☆90Updated 2 years ago