π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
β1,163May 3, 2024Updated last year
Alternatives and similar repositories for TransformerTTS
Users that are interested in TransformerTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β689Nov 8, 2023Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β580Mar 15, 2026Updated last month
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,992Jul 5, 2024Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ707Jul 12, 2022Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β649Oct 3, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The Implementation of FastSpeech based on pytorch.β880Jul 6, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β867Jul 22, 2023Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β845Oct 10, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,165Oct 27, 2023Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,639Apr 22, 2024Updated last year
- WaveRNN Vocoder + TTSβ2,180Jul 2, 2022Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,340Jul 27, 2024Updated last year
- β261Dec 8, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VCTK multi-speaker tacotron for ICASSP 2020β266Mar 29, 2022Updated 4 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β897Jul 6, 2023Updated 2 years ago
- List of speech synthesis papers.β1,071Jul 24, 2023Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ321Jul 25, 2024Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β329Sep 24, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.β184Aug 12, 2020Updated 5 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β359Mar 25, 2023Updated 3 years ago
- πΈ collection of TTS papersβ726Jul 4, 2024Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ234Jun 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unsupervised Speech Decomposition Via Triple Information Bottleneckβ699Oct 23, 2024Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Datasetβ362Dec 24, 2021Updated 4 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- End-to-End Speech Processing Toolkitβ9,805Updated this week
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generationβ197Feb 10, 2022Updated 4 years ago
- Simple text to phones converter for multiple languagesβ1,531Sep 26, 2024Updated last year
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.β409Jul 7, 2021Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!β360Apr 27, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.β144Jul 8, 2021Updated 4 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.β601Sep 18, 2023Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ917Jan 5, 2023Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,038Aug 28, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeechβ252Feb 9, 2022Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago