cyhuang-tw / AutoVC
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆33Updated 3 years ago
Alternatives and similar repositories for AutoVC:
Users that are interested in AutoVC are comparing it to the libraries listed below
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Collect Voice Conversion researches☆91Updated this week
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆115Updated 3 years ago
- ☆97Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆113Updated 4 years ago
- ☆91Updated 3 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- ☆28Updated 4 years ago
- Implementation of the AlignTTS☆76Updated last year
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆57Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆124Updated 4 years ago
- Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.☆23Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 6 months ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆67Updated 3 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆130Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆81Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago