Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
☆128Apr 9, 2021Updated 4 years ago
Alternatives and similar repositories for tacotron2
Users that are interested in tacotron2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 6 months ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 3 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- ☆484Oct 29, 2020Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆458Jun 26, 2024Updated last year
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆374Dec 8, 2022Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- Collect Voice Conversion researches☆96Mar 17, 2026Updated last week
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Feb 20, 2022Updated 4 years ago
- ☆13Sep 21, 2022Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Mar 25, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆283Oct 10, 2023Updated 2 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- ☆21Jun 16, 2021Updated 4 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- style token with tacotron2☆62Jul 6, 2023Updated 2 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆233Dec 27, 2019Updated 6 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆88Feb 23, 2021Updated 5 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago