Edresson / Coqui-TTSLinks
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆35Updated 3 years ago
Alternatives and similar repositories for Coqui-TTS
Users that are interested in Coqui-TTS are comparing it to the libraries listed below
Sorting:
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
 - The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
 - HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
 - Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
 - Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
 - Monotonic Alignment Search☆96Updated 4 months ago
 - JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆111Updated 3 years ago
 - TransferTTS (Zero-Shot learning of VITS)☆101Updated 3 years ago
 - ☆71Updated 2 years ago
 - TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆147Updated last year
 - [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated 2 years ago
 - ☆69Updated 2 years ago
 - This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆133Updated last year
 - ☆111Updated 3 years ago
 - AdaSpeech: Adaptive Text to Speech for Custom Voice☆162Updated 4 years ago
 - Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆70Updated 3 years ago
 - VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 3 years ago
 - Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆142Updated 3 years ago
 - Official Implementation of StyleTTS-VC☆191Updated 9 months ago
 - This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
 - Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
 - PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 3 years ago
 - An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Updated 4 years ago
 - SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆202Updated 3 years ago
 - S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
 - NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆150Updated 2 years ago
 - BigVGAN with Neural Source-Filter☆55Updated 2 years ago
 - Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 2 years ago
 - The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 4 years ago
 - Collect Voice Conversion researches☆94Updated last week