rebotnix / Tortoise-TTS-TrainingLinks
Community framework for training tortoise
☆43Updated 2 years ago
Alternatives and similar repositories for Tortoise-TTS-Training
Users that are interested in Tortoise-TTS-Training are comparing it to the libraries listed below
Sorting:
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- TorToiSe fine-tuning with DLAS☆224Updated last year
- Your one-stop solution for voice dataset creation☆123Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆46Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆129Updated 9 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated last year
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆249Updated 2 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆236Updated last year
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆289Updated 2 years ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆155Updated last year
- List of repositories relevant to VITS.☆36Updated 2 years ago
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆69Updated 2 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆136Updated 10 months ago
- One Shot Voice Cloning base on Unet-TTS☆242Updated 3 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆250Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆180Updated 2 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆148Updated 2 years ago
- Collect Voice Conversion researches☆93Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- DLAS - A configuration-driven trainer for generative models☆139Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 9 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆122Updated 3 years ago