π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
β1,161May 3, 2024Updated last year
Alternatives and similar repositories for TransformerTTS
Users that are interested in TransformerTTS are comparing it to the libraries listed below
Sorting:
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β690Nov 8, 2023Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β581Mar 15, 2026Updated last week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,995Jul 5, 2024Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ704Jul 12, 2022Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β650Oct 3, 2020Updated 5 years ago
- The Implementation of FastSpeech based on pytorch.β880Jul 6, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β866Jul 22, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,162Oct 27, 2023Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β844Oct 10, 2023Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,637Apr 22, 2024Updated last year
- WaveRNN Vocoder + TTSβ2,179Jul 2, 2022Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,332Jul 27, 2024Updated last year
- β262Dec 8, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β266Mar 29, 2022Updated 3 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β900Jul 6, 2023Updated 2 years ago
- List of speech synthesis papers.β1,068Jul 24, 2023Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ321Jul 25, 2024Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Sep 24, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.β184Aug 12, 2020Updated 5 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Mar 25, 2023Updated 2 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- Unsupervised Speech Decomposition Via Triple Information Bottleneckβ699Oct 23, 2024Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ233Jun 22, 2022Updated 3 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Datasetβ361Dec 24, 2021Updated 4 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- End-to-End Speech Processing Toolkitβ9,780Updated this week
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generationβ197Feb 10, 2022Updated 4 years ago
- Simple text to phones converter for multiple languagesβ1,520Sep 26, 2024Updated last year
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.β408Jul 7, 2021Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!β360Apr 27, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.β144Jul 8, 2021Updated 4 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.β602Sep 18, 2023Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ915Jan 5, 2023Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,037Aug 28, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeechβ252Feb 9, 2022Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago