π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.
β260Jun 10, 2024Updated 2 years ago
Alternatives and similar repositories for speech-dataset-generator
Users that are interested in speech-dataset-generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β19Mar 22, 2024Updated 2 years ago
- Fine Tune the Style-TTS2 Voice Modelβ265Jun 17, 2025Updated 11 months ago
- The open source code for SimpleSpeech seriesβ146Oct 8, 2024Updated last year
- All generative model in one for better TTS modelβ74Sep 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reference-aware automatic speech evaluation toolkitβ183Dec 5, 2024Updated last year
- Easy-to-Use Speech MOS predictorsβ356Oct 24, 2023Updated 2 years ago
- unofficial vits2-TTS implementation in pytorchβ549Mar 28, 2024Updated 2 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.β27Jan 21, 2025Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversionβ112Apr 1, 2024Updated 2 years ago
- β25Mar 6, 2024Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorchβ134Dec 29, 2025Updated 5 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioningβ161Jun 13, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio booksβ19Jul 27, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β49Sep 15, 2025Updated 8 months ago
- text to speech using autoregressive transformer and VITSβ248Apr 3, 2024Updated 2 years ago
- β151Apr 25, 2025Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turboβ223Apr 14, 2025Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)