rioharper / VoiceDatasetCreationLinks
β18Updated 3 years ago
Alternatives and similar repositories for VoiceDatasetCreation
Users that are interested in VoiceDatasetCreation are comparing it to the libraries listed below
Sorting:
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated last year
- β107Updated last year
- Audio datasets, easier.β84Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedbackβ¦β10Updated 10 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- Your one-stop solution for voice dataset creationβ123Updated last year
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- A fast MP3 decoder for python, using minimp3β29Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β71Updated 2 years ago
- DLAS - A configuration-driven trainer for generative modelsβ139Updated 2 years ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.β68Updated 3 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β26Updated 2 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!β43Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.β46Updated 2 years ago
- Community framework for training tortoiseβ43Updated 2 years ago
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- an improved version of Real-time-voice-cloningβ50Updated last year
- β62Updated last year
- The demo page of UniAudioβ34Updated last year
- Unsupervised Rhythm Modeling for Voice Conversionβ84Updated 2 years ago
- π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkβ66Updated 2 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ61Updated 2 years ago
- TorToiSe fine-tuning with DLASβ224Updated last year
- AudioLDM text to audio colabβ19Updated last year
- Text prompt steered synthetic audio generatorsβ49Updated 4 months ago
- β262Updated last year