guanlongzhao / fac-via-ppgView external linksLinks
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆148Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for fac-via-ppg
Users that are interested in fac-via-ppg are comparing it to the libraries listed below
Sorting:
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Jul 13, 2020Updated 5 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆340Jul 6, 2023Updated 2 years ago
- ☆15May 8, 2021Updated 4 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 2 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆150Feb 11, 2023Updated 3 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆698Oct 23, 2024Updated last year
- ☆88Nov 1, 2022Updated 3 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- ☆197May 3, 2024Updated last year
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Apr 29, 2022Updated 3 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- ☆90Sep 24, 2021Updated 4 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- ☆259May 15, 2023Updated 2 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- ☆100Jul 22, 2021Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Mar 3, 2022Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- ☆37May 8, 2021Updated 4 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,091Oct 23, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago