IIEleven11 / Automatic-Audio-Dataset-MakerLinks
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
β42Updated last month
Alternatives and similar repositories for Automatic-Audio-Dataset-Maker
Users that are interested in Automatic-Audio-Dataset-Maker are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β100Updated 2 weeks ago
- StyleTTS 2 Optimized Training Forkβ33Updated 6 months ago
- High quality text-to-speech based on StyleTTS 2.β60Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.β119Updated last month
- β270Updated last month
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion