Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
☆33Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for SingleVC
Users that are interested in SingleVC are comparing it to the libraries listed below
Sorting:
- Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features☆55Oct 11, 2021Updated 4 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Collect Voice Conversion researches☆96Updated this week
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆19Jul 8, 2025Updated 7 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- A simple voice conversion tool☆20Mar 10, 2022Updated 3 years ago
- ☆22Apr 4, 2023Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆115Dec 7, 2020Updated 5 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆12Feb 3, 2026Updated 3 weeks ago
- text to speech☆10Mar 19, 2024Updated last year
- A sequence-to-sequence voice conversion toolkit.☆108Jul 5, 2024Updated last year
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13May 6, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- An evaluation toolkit for voice conversion models.☆42Jul 11, 2021Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆131Dec 8, 2023Updated 2 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Jun 21, 2018Updated 7 years ago
- ☆13Sep 1, 2023Updated 2 years ago