google/tacotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/tacotron)

google / tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

☆539

Alternatives and similar repositories for tacotron

Users that are interested in tacotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

keithito / tacotron
View on GitHub
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
☆2,995Jul 6, 2023Updated 3 years ago
Kyubyong / tacotron
View on GitHub
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
☆1,833Jan 17, 2022Updated 4 years ago
sotelo / parrot
View on GitHub
RNN-based generative models for speech.
☆607Jun 23, 2017Updated 9 years ago
Rayhane-mamah / Tacotron-2
View on GitHub
DeepMind's Tacotron-2 Tensorflow implementation
☆2,323Jul 6, 2023Updated 3 years ago
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Kyubyong / expressive_tacotron
View on GitHub
Tensorflow Implementation of Expressive Tacotron
☆194Nov 3, 2018Updated 7 years ago
NVIDIA / tacotron2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆5,297Jun 12, 2024Updated 2 years ago
barronalex / Tacotron
View on GitHub
Implementation of Google's Tacotron in TensorFlow
☆237May 11, 2018Updated 8 years ago
lifeiteng / Rabbit
View on GitHub
Explore Text-To-Speech
☆25Jun 22, 2018Updated 8 years ago
candlewill / Ossian
View on GitHub
Ossian: A simple language-independent Text-to-speech frontend
☆17Mar 1, 2018Updated 8 years ago
Kyubyong / deepvoice3
View on GitHub
Tensorflow Implementation of Deep Voice 3
☆448Mar 14, 2018Updated 8 years ago
CSTR-Edinburgh / merlin
View on GitHub
This is now the official location of the Merlin project.
☆1,320Mar 3, 2020Updated 6 years ago
JeremyCCHsu / Python-Wrapper-for-World-Vocoder
View on GitHub
A Python wrapper for the high-quality vocoder "World"
☆790Jan 21, 2025Updated last year
jinhan / tacotron2-vae
View on GitHub
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆169Jul 6, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
nii-yamagishilab / tacotron2
View on GitHub
An implementation of Tacotron and Tacotron2
☆80Aug 4, 2021Updated 4 years ago
NVIDIA / nv-wavenet
View on GitHub
Reference implementation of real-time autoregressive wavenet inference
☆745Jan 19, 2021Updated 5 years ago
r9y9 / gantts
View on GitHub
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
☆518Nov 1, 2020Updated 5 years ago
r9y9 / tacotron_pytorch
View on GitHub
PyTorch implementation of Tacotron speech synthesis model.
☆310Jul 12, 2019Updated 7 years ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,219Sep 21, 2024Updated last year
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
KinglittleQ / GST-Tacotron
View on GitHub
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆374Dec 8, 2022Updated 3 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Kyubyong / tacotron_asr
View on GitHub
Speech Recognition Using Tacotron
☆164Sep 20, 2017Updated 8 years ago
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆885Jul 6, 2023Updated 3 years ago
soobinseo / Transformer-TTS
View on GitHub
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
☆691Nov 8, 2023Updated 2 years ago
mmorise / World
View on GitHub
A high-quality speech analysis, manipulation and synthesis system
☆1,332Feb 18, 2026Updated 5 months ago
syang1993 / gst-tacotron
View on GitHub
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆367Dec 6, 2018Updated 7 years ago
soroushmehr / sampleRNN_ICLR2017
View on GitHub
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
☆543Nov 12, 2021Updated 4 years ago
ibab / tensorflow-wavenet
View on GitHub
A TensorFlow implementation of DeepMind's WaveNet paper
☆5,429Jul 12, 2023Updated 3 years ago
facebookarchive / loop
View on GitHub
A method to generate speech across multiple speakers
☆874Mar 21, 2019Updated 7 years ago
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆293Aug 5, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ksw0306 / FloWaveNet
View on GitHub
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
☆490Apr 23, 2019Updated 7 years ago
fatchord / WaveRNN
View on GitHub
WaveRNN Vocoder + TTS
☆2,188Jul 2, 2022Updated 4 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
tomlepaine / fast-wavenet
View on GitHub
Speedy Wavenet generation using dynamic programming
☆1,772Jun 20, 2017Updated 9 years ago
jjery2243542 / adaptive_voice_conversion
View on GitHub
☆484Oct 29, 2020Updated 5 years ago
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
thuhcsi / Crystal.TTVS
View on GitHub
Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.
☆88Aug 17, 2020Updated 5 years ago