Awesome list of TTS papers with audio samples
β61Aug 18, 2020Updated 5 years ago
Alternatives and similar repositories for awesome-tts-samples
Users that are interested in awesome-tts-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of speech synthesis papers.β1,071Jul 24, 2023Updated 2 years ago
- πΈ collection of TTS papersβ727Jul 4, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- bumble bee transformerβ14Apr 19, 2021Updated 5 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RepVgg + HiFiGANβ36Aug 10, 2022Updated 3 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speechβ11Aug 12, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generationβ80Feb 24, 2021Updated 5 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ708Jul 12, 2022Updated 3 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdfβ370Nov 5, 2021Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPUβ73Jul 29, 2024Updated last year
- β16Apr 4, 2022Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversionβ339Jul 6, 2023Updated 2 years ago
- β14Aug 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognitionβ17Jul 25, 2024Updated last year
- Authors' implementation of DeepSpeech Distances.β130May 5, 2020Updated 5 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.β74Aug 3, 2021Updated 4 years ago
- β15May 8, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Pytorch Implementation of WaveNODEβ64Sep 4, 2020Updated 5 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotationsβ141Apr 27, 2024Updated 2 years ago
- This is a pytorch implementation of StarGAN-VC2.β13Dec 17, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATSβ64Nov 18, 2024Updated last year
- β69Mar 31, 2021Updated 5 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIANβ¦β74Sep 21, 2022Updated 3 years ago
- β179Nov 10, 2021Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ201Sep 4, 2022Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATSβ130Jun 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesisβ88Feb 23, 2021Updated 5 years ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)β284Oct 8, 2021Updated 4 years ago
- β64May 23, 2022Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoderβ272Jul 29, 2023Updated 2 years ago
- Official implementation of BVAE-TTSβ173Sep 26, 2022Updated 3 years ago
- Awesome TTSβ64Sep 16, 2021Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ84May 23, 2023Updated 2 years ago