seungwonpark/awesome-tts-samples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seungwonpark/awesome-tts-samples)

seungwonpark / awesome-tts-samples

Awesome list of TTS papers with audio samples

☆61

Alternatives and similar repositories for awesome-tts-samples

Users that are interested in awesome-tts-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
coqui-ai / TTS-papers
View on GitHub
🐸 collection of TTS papers
☆731Jul 4, 2024Updated 2 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated last year
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
yanggeng1995 / EATS
View on GitHub
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
☆127Jul 16, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
dathudeptrai / FastSpeech2
View on GitHub
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Aug 12, 2020Updated 5 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
tts-tutorial / survey
View on GitHub
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆371Nov 5, 2021Updated 4 years ago
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
BogiHsu / WG-WaveNet
View on GitHub
Real-Time High-Fidelity Speech Synthesis without GPU
☆73Jul 29, 2024Updated last year
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
bshall / ZeroSpeech
View on GitHub
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
☆339Jul 6, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
eatsleepraverepeat / reMUDE
View on GitHub
(re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition
☆17Jul 25, 2024Updated last year
mbinkowski / DeepSpeechDistances
View on GitHub
Authors' implementation of DeepSpeech Distances.
☆130May 5, 2020Updated 6 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
AppleHolic / pytorch_sound
View on GitHub
Sound Related Deep Learning Tasks boosting repository with pytorch
☆88Jul 25, 2024Updated last year
keonlee9420 / VAENAR-TTS
View on GitHub
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆74Aug 3, 2021Updated 4 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 5 years ago
ariacat3366 / pytorch-StarGAN-VC2-implementation
View on GitHub
This is a pytorch implementation of StarGAN-VC2.
☆13Dec 17, 2019Updated 6 years ago
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
BridgetteSong / ExpressiveTacotron
View on GitHub
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Sep 21, 2022Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
rishikksh20 / TFGAN
View on GitHub
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
☆88Feb 23, 2021Updated 5 years ago
dqqcasia / awesome-speech-translation
View on GitHub
☆178Nov 10, 2021Updated 4 years ago
maum-ai / univnet
View on GitHub
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
☆286Oct 8, 2021Updated 4 years ago
tts-tutorial / icassp2022
View on GitHub
☆64May 23, 2022Updated 4 years ago
LEEYOONHYUNG / BVAE-TTS
View on GitHub
Official implementation of BVAE-TTS
☆173Sep 26, 2022Updated 3 years ago
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 4 years ago