rebotnix / Tortoise-TTS-Training
Community framework for training tortoise
☆41Updated 2 years ago
Alternatives and similar repositories for Tortoise-TTS-Training:
Users that are interested in Tortoise-TTS-Training are comparing it to the libraries listed below
- Your one-stop solution for voice dataset creation☆118Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- TorToiSe fine-tuning with DLAS☆218Updated 8 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 4 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated last year
- ☆71Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆148Updated 2 years ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆144Updated last year
- Finetuning VITS Efficiently☆32Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆137Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Train the next generation of TTS systems.☆165Updated 6 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆238Updated last year
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Updated 3 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆127Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆102Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆243Updated 2 months ago