Awesome list of TTS papers with audio samples
β61Aug 18, 2020Updated 5 years ago
Alternatives and similar repositories for awesome-tts-samples
Users that are interested in awesome-tts-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of speech synthesis papers.β1,073Jul 24, 2023Updated 2 years ago
- πΈ collection of TTS papersβ727Jul 4, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020β214Jul 25, 2024Updated last year
- bumble bee transformerβ14Apr 19, 2021Updated 5 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- RepVgg + HiFiGANβ36Aug 10, 2022Updated 3 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speechβ11Aug 12, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generationβ80Feb 24, 2021Updated 5 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdfβ370Nov 5, 2021Updated 4 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ711Jul 12, 2022Updated 3 years ago
- Real-Time High-Fidelity Speech Synthesis without GPUβ73Jul 29, 2024Updated last year
- β16Apr 4, 2022Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversionβ338Jul 6, 2023Updated 2 years ago
- β14Aug 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognitionβ17Jul 25, 2024Updated last year
- Authors' implementation of DeepSpeech Distances.β130May 5, 2020Updated 6 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.β74Aug 3, 2021Updated 4 years ago
- β15May 8, 2021Updated 5 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"β116Dec 22, 2021Updated 4 years ago
- Pytorch Implementation of WaveNODEβ64Sep 4, 2020Updated 5 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotationsβ141Apr 27, 2024Updated 2 years ago
- This is a pytorch implementation of StarGAN-VC2.β13Dec 17, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATSβ64Nov 18, 2024Updated last year
- β69Mar 31, 2021Updated 5 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIANβ¦β74Sep 21, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ201Sep 4, 2022Updated 3 years ago
- β179Nov 10, 2021Updated 4 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATSβ130Jun 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesisβ88Feb 23, 2021Updated 5 years ago
- β64May 23, 2022Updated 3 years ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)β285Oct 8, 2021Updated 4 years ago
- Official implementation of the source-filter HiFiGAN vocoderβ272Jul 29, 2023Updated 2 years ago
- Official implementation of BVAE-TTSβ173Sep 26, 2022Updated 3 years ago
- Awesome TTSβ64Sep 16, 2021Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ84May 23, 2023Updated 2 years ago