Gaussian Mixture VAE Tacotron
☆53Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for gmvae_tacotron
Users that are interested in gmvae_tacotron are comparing it to the libraries listed below
Sorting:
- ☆51Feb 15, 2019Updated 7 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆374Dec 8, 2022Updated 3 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆232Dec 27, 2019Updated 6 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- ☆37May 8, 2021Updated 4 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆64Aug 14, 2023Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Dec 5, 2018Updated 7 years ago
- Authors' implementation of DeepSpeech Distances.☆130May 5, 2020Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)☆128Jul 25, 2024Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago