keonlee9420/Deep-Learning-TTS-Template

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/keonlee9420/Deep-Learning-TTS-Template)

keonlee9420 / Deep-Learning-TTS-Template

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

☆14

Alternatives and similar repositories for Deep-Learning-TTS-Template

Users that are interested in Deep-Learning-TTS-Template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
naderman / PyrusBundle
View on GitHub
Integrates Pyrus, the management tool for PEAR2 packages, into Symfony
☆15Aug 28, 2013Updated 12 years ago
maum-ai / maum-ai.github.io
View on GitHub
maum-ai.github.io
☆15Jun 12, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
phrasenmaeher / audio-transformation-visualization
View on GitHub
A streamlit application that lets you explore the effect of different audio augmentation techniques
☆28Sep 18, 2022Updated 3 years ago
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
IIP-Sogang / olkavs-avspeech
View on GitHub
The Introduction of the OLKAVS Dataset
☆39May 28, 2024Updated 2 years ago
SoonbeomChoi / BEGANSing
View on GitHub
Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN
☆67Apr 26, 2021Updated 5 years ago
rishikksh20 / AdaSpeech
View on GitHub
AdaSpeech: Adaptive Text to Speech for Custom Voice
☆162Aug 31, 2021Updated 4 years ago
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
snu-mllab / DisentanglementICML19
View on GitHub
"Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019
☆22Aug 22, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
lwang114 / UnsupTTS
View on GitHub
☆37Mar 26, 2024Updated 2 years ago
Deepest-Project / meta-learning-study
View on GitHub
Deepest Season 6 Meta-Learning study papers plus alpha
☆25Mar 4, 2020Updated 6 years ago
Tomiinek / Blizzard2013_Segmentation
View on GitHub
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
tzutalin / minizip
View on GitHub
Minizip for Unix/Linux and mobile devices
☆10Aug 31, 2017Updated 8 years ago
wangxiongts / vllm
View on GitHub
☆18Jan 26, 2026Updated 5 months ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
fastai / cards_deck
View on GitHub
A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed
☆11Jul 29, 2022Updated 3 years ago
thuhcsi / tacotron
View on GitHub
PyTorch implementation of Tacotron and Tacotron2
☆34Jul 19, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
francislata / unicats
View on GitHub
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆26Nov 4, 2023Updated 2 years ago
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
kpaul073 / AGI_HER_SV
View on GitHub
Flow matching based speaker verification
☆24Dec 20, 2025Updated 7 months ago
justinwlin / runpodWhisperx
View on GitHub
Runpod WhisperX Docker Container Repo
☆16Mar 10, 2024Updated 2 years ago
One-Shot-Voice-Conversion-with-WIN / WINVC
View on GitHub
Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".
☆30Nov 13, 2021Updated 4 years ago
Chien-Hung / Speech-Emotion-Recognition
View on GitHub
3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.
☆44Nov 13, 2020Updated 5 years ago
CleverTap / Analytics_ds_articles
View on GitHub
☆13Aug 4, 2016Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jgarciapueyo / MelNet-SpeechGeneration
View on GitHub
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆25Sep 16, 2020Updated 5 years ago
seongq / AGI_HER_SE
View on GitHub
☆24Dec 19, 2025Updated 7 months ago
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
seongq / AGI_HER_MER
View on GitHub
☆29Dec 19, 2025Updated 7 months ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
keonlee9420 / VAENAR-TTS
View on GitHub
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆74Aug 3, 2021Updated 4 years ago