Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆30May 28, 2020Updated 5 years ago
Alternatives and similar repositories for tacotron2
Users that are interested in tacotron2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Nov 1, 2019Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆54Feb 26, 2020Updated 6 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28May 25, 2023Updated 2 years ago
- ☆17Aug 27, 2025Updated 7 months ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Jan 4, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Tacotron2 with Global Style Tokens☆65Apr 19, 2019Updated 6 years ago
- Decision Making with Genetic Algorithms using DEAP☆16Aug 17, 2016Updated 9 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Jul 31, 2023Updated 2 years ago
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆148Apr 12, 2022Updated 3 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Mel spectrum based on tacotron2 for melgan speech synthesis☆15Mar 24, 2023Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Feb 20, 2022Updated 4 years ago
- ☆14Apr 2, 2023Updated 2 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- style token with tacotron2☆62Jul 6, 2023Updated 2 years ago
- Theano implementation of Sequence-to-Sequence Autoencoder☆13Jun 1, 2018Updated 7 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.☆21Jun 7, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Persian Grapheme-to-Phoneme (G2P) converter☆21Dec 15, 2020Updated 5 years ago
- Cross-lingual Voice Conversion☆97Feb 5, 2018Updated 8 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 9 months ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆650Oct 3, 2020Updated 5 years ago
- MelGAN implementation with Multi-Band and Full Band supports...☆62Aug 27, 2020Updated 5 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆123Jul 2, 2019Updated 6 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Jul 6, 2023Updated 2 years ago
- Jupyter notebooks for testing concepts☆11Nov 9, 2017Updated 8 years ago
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Jul 6, 2023Updated 2 years ago