Edresson / Coqui-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β33Updated 3 years ago
Alternatives and similar repositories for Coqui-TTS:
Users that are interested in Coqui-TTS are comparing it to the libraries listed below
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β79Updated last year
- Monotonic Alignment Searchβ91Updated 2 years ago
- Singing Voice Speech modeling testβ35Updated 2 years ago
- β71Updated last year
- Adaptive Vocoder for Custom Voiceβ59Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Dataβ70Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2β51Updated last year
- β56Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Updated 2 years ago
- The Official Implementation of βContent-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthβ¦β85Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.β29Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Updated 2 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.β72Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ63Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speakeβ¦β57Updated last year
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021β39Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingβ123Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ44Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"β28Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ120Updated 2 years ago
- β29Updated 3 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTSβ23Updated 3 years ago
- β79Updated 11 months ago
- β20Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speechβ46Updated 2 years ago
- How to use our public wav2vec2 age and gender modelβ39Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesisβ56Updated 3 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)β24Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processingβ70Updated 2 years ago