spring-media / ForwardTacotronLinks
β© Generating speech in a single forward pass without any attention!
β579Updated 10 months ago
Alternatives and similar repositories for ForwardTacotron
Users that are interested in ForwardTacotron are comparing it to the libraries listed below
Sorting:
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,147Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β861Updated last year
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β898Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ690Updated 2 years ago
- The Implementation of FastSpeech based on pytorch.β871Updated last year
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β645Updated 4 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ472Updated 5 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β836Updated last year
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β678Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020β266Updated 3 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"β436Updated 4 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ538Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,606Updated last year
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"β252Updated 5 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)β516Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"β367Updated 6 years ago
- Text to Speech with PyTorch (English and Mongolian)β184Updated 8 months ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β368Updated last week
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.β498Updated last year
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowβ128Updated 4 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesisβ367Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ853Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.β403Updated 3 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baiduβ254Updated 4 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speechβ450Updated 11 months ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"β211Updated 10 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ230Updated 2 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"β239Updated 4 years ago