An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆54Sep 14, 2022Updated 3 years ago
Alternatives and similar repositories for Glow_TTS
Users that are interested in Glow_TTS are comparing it to the libraries listed below
Sorting:
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆703Jul 12, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆69Jul 29, 2023Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- Implementation of Multi speaker TTS☆51Jan 2, 2021Updated 5 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 9 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Jul 6, 2023Updated 2 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- ☆19Mar 22, 2024Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Jun 6, 2022Updated 3 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆94Sep 1, 2021Updated 4 years ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 10 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago