Edresson / Coqui-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β34Updated 3 years ago
Alternatives and similar repositories for Coqui-TTS
Users that are interested in Coqui-TTS are comparing it to the libraries listed below
Sorting:
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β79Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2β51Updated last year
- How to use our public wav2vec2 age and gender modelβ40Updated last year
- β71Updated last year
- β20Updated 2 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)β24Updated last year
- β31Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"β132Updated last year
- Monotonic Alignment Searchβ91Updated 2 years ago
- β69Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTSβ23Updated 3 years ago
- β29Updated last year
- β13Updated 2 years ago
- The Official Implementation of βContent-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthβ¦β85Updated 2 years ago
- β33Updated last year
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptionsβ76Updated 7 months ago
- Adaptive Vocoder for Custom Voiceβ59Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.β48Updated 3 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Dataβ70Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversionβ39Updated 2 years ago
- β29Updated 3 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using Γ-VAE"β42Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesisβ56Updated 3 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- All generative model in one for better TTS modelβ71Updated 8 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Updated 2 years ago
- BigVGAN with Neural Source-Filterβ55Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIANβ¦β74Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingβ123Updated 2 years ago