IIEleven11 / Automatic-Audio-Dataset-MakerLinks
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
β46Updated 3 months ago
Alternatives and similar repositories for Automatic-Audio-Dataset-Maker
Users that are interested in Automatic-Audio-Dataset-Maker are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β131Updated 4 months ago
- SoTA open-source TTSβ122Updated 6 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β127Updated 5 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β86Updated last year
- β290Updated 5 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ187Updated last year
- β52Updated last week
- High quality text-to-speech based on StyleTTS 2.β71Updated last week
- StyleTTS 2 Optimized Training Forkβ33Updated 10 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β256Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.