Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
☆539Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for tacotron
Users that are interested in tacotron are comparing it to the libraries listed below
Sorting:
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,835Jan 17, 2022Updated 4 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,989Jul 6, 2023Updated 2 years ago
- RNN-based generative models for speech.☆610Jun 23, 2017Updated 8 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆2,318Jul 6, 2023Updated 2 years ago
- Implementation of Google's Tacotron in TensorFlow☆236May 11, 2018Updated 7 years ago
- An implementation of Tacotron and Tacotron2☆80Aug 4, 2021Updated 4 years ago
- Explore Text-To-Speech☆25Jun 22, 2018Updated 7 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆311Jul 12, 2019Updated 6 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆5,303Jun 12, 2024Updated last year
- A Python wrapper for the high-quality vocoder "World"☆778Jan 21, 2025Updated last year
- Reference implementation of real-time autoregressive wavenet inference☆746Jan 19, 2021Updated 5 years ago
- Tensorflow Implementation of Deep Voice 3☆449Mar 14, 2018Updated 7 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- This is now the official location of the Merlin project.☆1,322Mar 3, 2020Updated 6 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆518Nov 1, 2020Updated 5 years ago
- A high-quality speech analysis, manipulation and synthesis system☆1,299Feb 18, 2026Updated last week
- Speech Recognition Using Tacotron☆164Sep 20, 2017Updated 8 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆690Nov 8, 2023Updated 2 years ago
- A method to generate speech across multiple speakers☆875Mar 21, 2019Updated 6 years ago
- A TensorFlow implementation of DeepMind's WaveNet paper☆5,437Jul 12, 2023Updated 2 years ago
- WaveRNN Vocoder + TTS☆2,177Jul 2, 2022Updated 3 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆544Nov 12, 2021Updated 4 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆374Dec 8, 2022Updated 3 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- Ossian: A simple language-independent Text-to-speech frontend☆17Mar 1, 2018Updated 8 years ago
- Efficient neural speech synthesis☆1,203Sep 21, 2024Updated last year
- Speedy Wavenet generation using dynamic programming☆1,772Jun 20, 2017Updated 8 years ago
- A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"☆490Apr 23, 2019Updated 6 years ago
- ☆484Oct 29, 2020Updated 5 years ago
- ☆24Dec 22, 2016Updated 9 years ago
- Deep Voice: Real-time Neural Text-to-Speech☆364Mar 21, 2017Updated 8 years ago
- A Pytorch Implementation of ClariNet☆292Aug 5, 2019Updated 6 years ago