BridgetteSong/ExpressiveTacotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BridgetteSong/ExpressiveTacotron)

BridgetteSong / ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

☆74

Alternatives and similar repositories for ExpressiveTacotron

Users that are interested in ExpressiveTacotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BridgetteSong / Tacotron2
View on GitHub
☆13Sep 21, 2022Updated 3 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
KevinMIN95 / StyleSpeech
View on GitHub
Official implementation of Meta-StyleSpeech and StyleSpeech
☆253Feb 9, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
keonlee9420 / VAENAR-TTS
View on GitHub
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆74Aug 3, 2021Updated 4 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
keonlee9420 / STYLER
View on GitHub
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…
☆159Jun 5, 2025Updated last year
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
JoungheeKim / Non-Attentive-Tacotron
View on GitHub
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Dec 21, 2022Updated 3 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
nii-yamagishilab / Extended_VQVAE
View on GitHub
☆64Aug 14, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
keonlee9420 / StyleSpeech
View on GitHub
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
☆197Feb 10, 2022Updated 4 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
hs-oh-prml / DiffProsody
View on GitHub
☆69Jul 29, 2023Updated 2 years ago
rishikksh20 / UnivNet-pytorch
View on GitHub
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
☆76Aug 30, 2021Updated 4 years ago
ncsoft / avocodo
View on GitHub
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
☆154Feb 1, 2023Updated 3 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rishikksh20 / Fre-GAN-pytorch
View on GitHub
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
☆112Aug 26, 2021Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
neosapience / editts
View on GitHub
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆122Jan 24, 2023Updated 3 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
LEEYOONHYUNG / BVAE-TTS
View on GitHub
Official implementation of BVAE-TTS
☆173Sep 26, 2022Updated 3 years ago
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
freds0 / katube
View on GitHub
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…
☆26Jul 27, 2024Updated last year
gmltmd789 / UnitSpeech
View on GitHub
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
☆137Aug 17, 2023Updated 2 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago