spring-media / TransformerTTSLinks
π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
β1,150Updated last year
Alternatives and similar repositories for TransformerTTS
Users that are interested in TransformerTTS are comparing it to the libraries listed below
Sorting:
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- β© Generating speech in a single forward pass without any attention!β579Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ695Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.β873Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,611Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β861Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β897Updated 2 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β679Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,349Updated last year
- πΈ collection of TTS papersβ706Updated last year
- WaveRNN Vocoder + TTSβ2,164Updated 3 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β987Updated last month
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,015Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,065Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,195Updated last year
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β645Updated 4 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Lossβ1,071Updated 9 months ago
- A Flow-based Generative Network for Speech Synthesisβ2,332Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ475Updated 5 years ago
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new β¦β1,303Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ545Updated 2 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"β436Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementationβ2,309Updated 2 years ago
- List of speech synthesis papers.β1,052Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ866Updated 2 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,007Updated 8 months ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β361Updated 2 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,980Updated last year
- Unsupervised Speech Decomposition Via Triple Information Bottleneckβ691Updated 9 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,954Updated last year