Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆203Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for FragmentVC
Users that are interested in FragmentVC are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- ☆100Jul 22, 2021Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Jun 22, 2020Updated 5 years ago
- ☆484Oct 29, 2020Updated 5 years ago
- An evaluation toolkit for voice conversion models.☆42Jul 11, 2021Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆115Dec 7, 2020Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- ☆22Aug 21, 2020Updated 5 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Jan 8, 2024Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- Collect Voice Conversion researches☆96Updated this week
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,094Oct 23, 2024Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆699Oct 23, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- ☆90Sep 24, 2021Updated 4 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Apr 26, 2021Updated 4 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Jun 26, 2024Updated last year
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- ☆171Jul 25, 2022Updated 3 years ago
- ☆30Jun 30, 2020Updated 5 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- ☆42Mar 25, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago