thuhcsi/tacotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuhcsi/tacotron)

thuhcsi / tacotron

PyTorch implementation of Tacotron and Tacotron2

☆34

Alternatives and similar repositories for tacotron

Users that are interested in tacotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
awesome-archive / tacotron_cn
View on GitHub
chinese_tacotron-2
☆12Feb 27, 2018Updated 8 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
YichaoL / Chinese_Polyphone_Disambiguation
View on GitHub
论文复现，使用pos标记进行中文多音字消歧
☆21Jul 20, 2019Updated 7 years ago
keonlee9420 / Stepwise_Monotonic_Multihead_Attention
View on GitHub
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …
☆39May 16, 2021Updated 5 years ago
r9y9 / tacotron_pytorch
View on GitHub
PyTorch implementation of Tacotron speech synthesis model.
☆310Jul 12, 2019Updated 7 years ago
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 11 months ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
thuhcsi / SpanPSP
View on GitHub
☆76Apr 26, 2022Updated 4 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
papercup-open-source / phonological-features
View on GitHub
Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"
☆36Aug 11, 2020Updated 5 years ago
KinglittleQ / GST-Tacotron
View on GitHub
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆374Dec 8, 2022Updated 3 years ago
thuhcsi / DiffVar
View on GitHub
☆30Aug 12, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Updated this week
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
bfs18 / e2_tts
View on GitHub
☆69Sep 3, 2024Updated last year
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
MingjieChen / EasyVC
View on GitHub
A toolkit for any-to-any encoder-decoder voice conversion systems
☆83Aug 10, 2023Updated 2 years ago
keonlee9420 / Comprehensive-Transformer-TTS
View on GitHub
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆328Sep 24, 2022Updated 3 years ago
thuhcsi / mm2022-conversational-tts
View on GitHub
☆11May 9, 2023Updated 3 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
bfs18 / rfwave
View on GitHub
☆152Apr 25, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shackysureshot / Mel-Cepstral-Distortion
View on GitHub
Calculation of MCD (dB) between two speech waveforms
☆57Sep 26, 2020Updated 5 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
roholazandie / ryan-tts
View on GitHub
☆18Jan 17, 2022Updated 4 years ago
karamarieliu / gst_tacotron2_wavenet
View on GitHub
☆13Aug 11, 2018Updated 7 years ago
roedoejet / FastSpeech2_ACL2022_reproducibility
View on GitHub
☆21Feb 27, 2024Updated 2 years ago
keonlee9420 / Deep-Learning-TTS-Template
View on GitHub
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆14Jun 15, 2021Updated 5 years ago