Awesome list of TTS papers with audio samples
β61Aug 18, 2020Updated 5 years ago
Alternatives and similar repositories for awesome-tts-samples
Users that are interested in awesome-tts-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of speech synthesis papers.β1,069Jul 24, 2023Updated 2 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- bumble bee transformerβ14Apr 19, 2021Updated 4 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- RepVgg + HiFiGANβ36Aug 10, 2022Updated 3 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speechβ11Aug 12, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generationβ80Feb 24, 2021Updated 5 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ706Jul 12, 2022Updated 3 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdfβ372Nov 5, 2021Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPUβ73Jul 29, 2024Updated last year
- β16Apr 4, 2022Updated 3 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversionβ339Jul 6, 2023Updated 2 years ago
- β14Aug 19, 2024Updated last year
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognitionβ17Jul 25, 2024Updated last year
- Authors' implementation of DeepSpeech Distances.β130May 5, 2020Updated 5 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.β73Aug 3, 2021Updated 4 years ago
- β15May 8, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Pytorch Implementation of WaveNODEβ64Sep 4, 2020Updated 5 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotationsβ142Apr 27, 2024Updated last year
- This is a pytorch implementation of StarGAN-VC2.β13Dec 17, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago
- β69Mar 31, 2021Updated 4 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATSβ64Nov 18, 2024Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIANβ¦β74Sep 21, 2022Updated 3 years ago
- β179Nov 10, 2021Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ203Sep 4, 2022Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATSβ130Jun 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesisβ88Feb 23, 2021Updated 5 years ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)β283Oct 8, 2021Updated 4 years ago
- β64May 23, 2022Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoderβ270Jul 29, 2023Updated 2 years ago
- Official implementation of BVAE-TTSβ173Sep 26, 2022Updated 3 years ago
- Awesome TTSβ64Sep 16, 2021Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ84May 23, 2023Updated 2 years ago