Global Rhythm Style Transfer Without Text Transcriptions
☆284Oct 23, 2024Updated last year
Alternatives and similar repositories for AutoPST
Users that are interested in AutoPST are comparing it to the libraries listed below
Sorting:
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆699Oct 23, 2024Updated last year
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,094Oct 23, 2024Updated last year
- speech self-supervised representations☆517Apr 27, 2023Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- Official implementation of SpeechSplit2☆135Oct 22, 2022Updated 3 years ago
- Demo for 2022 Interspeech☆29Jun 14, 2022Updated 3 years ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆42Nov 18, 2025Updated 4 months ago
- PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.☆330Feb 9, 2024Updated 2 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- Collect Voice Conversion researches☆96Updated this week
- ☆82Jan 22, 2025Updated last year
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- Deep learning based Speech Beamforming☆64Mar 29, 2018Updated 7 years ago
- ☆129Apr 2, 2023Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ☆100Jul 22, 2021Updated 4 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- Official Implementation of StyleTTS-VC☆198Jan 14, 2025Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- A curated list of awesome voice conversion, projects and communities.☆262Nov 18, 2025Updated 4 months ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆259Jul 13, 2023Updated 2 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆707Jan 19, 2025Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.☆23Jan 24, 2021Updated 5 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago