NVIDIA/tacotron2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/tacotron2)

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

☆5,299

Alternatives and similar repositories for tacotron2

Users that are interested in tacotron2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / waveglow
View on GitHub
A Flow-based Generative Network for Speech Synthesis
☆2,339Oct 19, 2023Updated 2 years ago
Rayhane-mamah / Tacotron-2
View on GitHub
DeepMind's Tacotron-2 Tensorflow implementation
☆2,324Jul 6, 2023Updated 3 years ago
keithito / tacotron
View on GitHub
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
☆2,996Jul 6, 2023Updated 3 years ago
r9y9 / wavenet_vocoder
View on GitHub
WaveNet vocoder
☆2,375Jul 29, 2023Updated 3 years ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,363Jul 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fatchord / WaveRNN
View on GitHub
WaveRNN Vocoder + TTS
☆2,191Jul 2, 2022Updated 4 years ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
NVIDIA / nv-wavenet
View on GitHub
Reference implementation of real-time autoregressive wavenet inference
☆745Jan 19, 2021Updated 5 years ago
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆885Jul 6, 2023Updated 3 years ago
r9y9 / deepvoice3_pytorch
View on GitHub
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
☆1,976Dec 19, 2023Updated 2 years ago
ming024 / FastSpeech2
View on GitHub
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆2,185Oct 27, 2023Updated 2 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,904Updated this week
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mozilla / TTS
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10,164Nov 9, 2023Updated 2 years ago
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
NVIDIA / flowtron
View on GitHub
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…
☆895Jul 6, 2023Updated 3 years ago
descriptinc / melgan-neurips
View on GitHub
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,040Aug 28, 2023Updated 2 years ago
soobinseo / Transformer-TTS
View on GitHub
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
☆692Nov 8, 2023Updated 2 years ago
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,219Sep 21, 2024Updated last year
TensorSpeech / TensorFlowTTS
View on GitHub
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…
☆3,993Jul 5, 2024Updated 2 years ago
r9y9 / tacotron_pytorch
View on GitHub
PyTorch implementation of Tacotron speech synthesis model.
☆310Jul 12, 2019Updated 7 years ago
Kyubyong / tacotron
View on GitHub
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
☆1,833Jan 17, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
syang1993 / gst-tacotron
View on GitHub
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆367Dec 6, 2018Updated 7 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
KinglittleQ / GST-Tacotron
View on GitHub
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆374Dec 8, 2022Updated 3 years ago
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,890Dec 6, 2023Updated 2 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 3 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,066Mar 9, 2026Updated 4 months ago
CSTR-Edinburgh / merlin
View on GitHub
This is now the official location of the Merlin project.
☆1,320Mar 3, 2020Updated 6 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,857Jul 11, 2026Updated 2 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,559Sep 26, 2024Updated last year
Tomiinek / Multilingual_Text_to_Speech
View on GitHub
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆844Oct 10, 2023Updated 2 years ago
resemble-ai / Resemblyzer
View on GitHub
A python package to analyze and compare voices with deep learning
☆3,292Oct 12, 2023Updated 2 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,100Oct 23, 2024Updated last year
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆293Aug 5, 2019Updated 6 years ago
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,726Jun 15, 2026Updated last month
Kyubyong / css10
View on GitHub
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆490Mar 6, 2020Updated 6 years ago