JarodMica / tortoise_dataset_toolsLinks
Misc. tools/scripts that I made to use for tortoise
☆21Updated last year
Alternatives and similar repositories for tortoise_dataset_tools
Users that are interested in tortoise_dataset_tools are comparing it to the libraries listed below
Sorting:
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆23Updated last year
- Your one-stop solution for voice dataset creation☆127Updated last year
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆47Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆83Updated 2 years ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆158Updated 2 years ago
- Official Implementation of StyleTTS-VC☆191Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year
- An unofficial PyTorch implementation of VALL-E☆88Updated 2 months ago
- StyleTTS 2 Optimized Training Fork☆34Updated 8 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 6 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- List of repositories relevant to VITS.☆35Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 8 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- VALL-E 2 reproduction☆131Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 11 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 9 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 2 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆130Updated 11 months ago
- VoiceBox neural network implementation☆110Updated last year
- My vocoder experiments☆31Updated 3 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆186Updated last year