Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
Alternatives and similar repositories for Phone-Level-Mixture-Density-Network-for-TTS
Users that are interested in Phone-Level-Mixture-Density-Network-for-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Official implementation of SpeechSplit2☆135Oct 22, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 9 months ago
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆73Aug 3, 2021Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆169Apr 10, 2024Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ☆64Jan 15, 2024Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆69Mar 31, 2021Updated 4 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆45Mar 2, 2021Updated 5 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆26Nov 4, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆98Jun 7, 2022Updated 3 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆56Nov 16, 2025Updated 4 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Apr 29, 2022Updated 3 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆129Apr 8, 2023Updated 2 years ago
- ☆111Mar 9, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆26Sep 22, 2022Updated 3 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample☆100Jul 26, 2022Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago