Collect Voice Conversion researches
☆96Mar 17, 2026Updated this week
Alternatives and similar repositories for VoiceConversionLab
Users that are interested in VoiceConversionLab are comparing it to the libraries listed below
Sorting:
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Apr 13, 2022Updated 3 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆115Dec 7, 2020Updated 5 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Oct 19, 2020Updated 5 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Nov 6, 2023Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated last week
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- ☆484Oct 29, 2020Updated 5 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆116Jun 23, 2025Updated 8 months ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- ☆171Jul 25, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆20Jun 5, 2022Updated 3 years ago