kaituoxu / Tacotron2View external linksLinks
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
☆52Jan 30, 2019Updated 7 years ago
Alternatives and similar repositories for Tacotron2
Users that are interested in Tacotron2 are comparing it to the libraries listed below
Sorting:
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- FFTNet vocoder implementation☆81Sep 28, 2018Updated 7 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- A implementation voice morphing using relgan with tensorflow☆25Mar 24, 2023Updated 2 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- Interface for running Praat scripts through Python☆17May 16, 2025Updated 9 months ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆119Mar 18, 2023Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)☆12Nov 22, 2019Updated 6 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Score- and Lyrics-Free Singing Voice Generation☆28May 25, 2020Updated 5 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- An implementation of GlowTTS designed to work with Gruut☆12Mar 9, 2022Updated 3 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 4 years ago
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 3 years ago
- An implementation of Tacotron and Tacotron2☆80Aug 4, 2021Updated 4 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Apr 9, 2019Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 7 years ago
- Implementation of DCTTS with Adversarial Training☆12Dec 30, 2019Updated 6 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Dec 2, 2020Updated 5 years ago
- Voice conversion tools for STRAIGHT☆29Jul 17, 2020Updated 5 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Asteroid's filterbanks☆88Jan 12, 2025Updated last year
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- ☆12Nov 5, 2019Updated 6 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Nov 27, 2019Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Mar 29, 2019Updated 6 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Mar 14, 2019Updated 6 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- End-2-end speech synthesis with recurrent neural networks☆223Feb 24, 2024Updated last year