rhoposit / multilingual_VQVAEView external linksLinks
☆37May 8, 2021Updated 4 years ago
Alternatives and similar repositories for multilingual_VQVAE
Users that are interested in multilingual_VQVAE are comparing it to the libraries listed below
Sorting:
- ☆15May 8, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Apr 10, 2024Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- ☆11May 9, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆340Jul 6, 2023Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago