π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
β1,162May 3, 2024Updated last year
Alternatives and similar repositories for TransformerTTS
Users that are interested in TransformerTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β689Nov 8, 2023Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β581Mar 15, 2026Updated 2 weeks ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,994Jul 5, 2024Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ704Jul 12, 2022Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β650Oct 3, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Implementation of FastSpeech based on pytorch.β880Jul 6, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β866Jul 22, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,162Oct 27, 2023Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β845Oct 10, 2023Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,637Apr 22, 2024Updated last year
- WaveRNN Vocoder + TTSβ2,179Jul 2, 2022Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,336Jul 27, 2024Updated last year
- β262Dec 8, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VCTK multi-speaker tacotron for ICASSP 2020β266Mar 29, 2022Updated 4 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β900Jul 6, 2023Updated 2 years ago
- List of speech synthesis papers.β1,069Jul 24, 2023Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ321Jul 25, 2024Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Sep 24, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.β184Aug 12, 2020Updated 5 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Mar 25, 2023Updated 3 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ233Jun 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Unsupervised Speech Decomposition Via Triple Information Bottleneckβ699Oct 23, 2024Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Datasetβ362Dec 24, 2021Updated 4 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- End-to-End Speech Processing Toolkitβ9,788Updated this week
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generationβ197Feb 10, 2022Updated 4 years ago
- Simple text to phones converter for multiple languagesβ1,524Sep 26, 2024Updated last year
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.β408Jul 7, 2021Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!β360Apr 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.β144Jul 8, 2021Updated 4 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.β602Sep 18, 2023Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ915Jan 5, 2023Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,037Aug 28, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech