This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆14Jun 15, 2021Updated 4 years ago
Alternatives and similar repositories for Deep-Learning-TTS-Template
Users that are interested in Deep-Learning-TTS-Template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Feb 20, 2022Updated 4 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- ☆30Dec 19, 2025Updated 3 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Sep 18, 2022Updated 3 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- ☆18Jun 14, 2025Updated 9 months ago
- The Introduction of the OLKAVS Dataset☆37May 28, 2024Updated last year
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆67Apr 26, 2021Updated 4 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆162Aug 31, 2021Updated 4 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆602Sep 18, 2023Updated 2 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- "Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019☆22Aug 22, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆45Nov 13, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Runpod WhisperX Docker Container Repo☆15Mar 10, 2024Updated 2 years ago
- ☆37Mar 26, 2024Updated 2 years ago
- A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed☆10Jul 29, 2022Updated 3 years ago
- Deepest Season 6 Meta-Learning study papers plus alpha☆25Mar 4, 2020Updated 6 years ago
- Minizip for Unix/Linux and mobile devices☆10Aug 31, 2017Updated 8 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆26Nov 4, 2023Updated 2 years ago
- ☆31Jul 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- PyTorch implementation of Tacotron and Tacotron2☆34Jul 19, 2022Updated 3 years ago
- The bare metal in my basement☆21Dec 4, 2025Updated 3 months ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆45Nov 13, 2020Updated 5 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".☆30Nov 13, 2021Updated 4 years ago
- ☆14Aug 4, 2016Updated 9 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆25Sep 16, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- ☆20Apr 18, 2024Updated last year
- ☆10May 15, 2021Updated 4 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year