devidw / dswavLinks
Tooling to build datasets for audio model training
ā16Updated 2 years ago
Alternatives and similar repositories for dswav
Users that are interested in dswav are comparing it to the libraries listed below
Sorting:
- š Text-prompted Generative Audio Model - With the ability to clone voicesā20Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Forkā127Updated last year
- Create training data for training a voice cloner for bark text to speech.ā48Updated 2 years ago
- Official implementation of the TTS model Lina-Speechā176Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-aiā129Updated 2 years ago
- TorToiSe fine-tuning with DLASā226Updated last year
- audiolm-pytorch training codeā15Updated 2 years ago
- An unofficial PyTorch implementation of VALL-Eā88Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioā69Updated 2 years ago
- Your one-stop solution for voice dataset creationā128Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationā131Updated 2 years ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWeiā162Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorchā131Updated last month
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionā187Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā106Updated last year
- The reproduced code for Google's SoundStormā270Updated 2 years ago
- ā258Updated last year
- Train the next generation of TTS systems.ā170Updated last year
- Official Implementation of StyleTTS-VCā196Updated last year
- ā71Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.ā228Updated 3 years ago
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā161Updated last year
- In this repository I will be running various experiments on finetune different parts for xttsā15Updated last year
- VALL-E 2 reproductionā134Updated last year
- š Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. š§š„š Advanced audio processing.ā258Updated last year
- šļø Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets āØā135Updated 5 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.ā86Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed dataā13Updated 9 months ago
- RVC Onnx Infer- Upgraded and simplified-ishā25Updated last year
- ChatTTS is a generative speech model for daily dialogue.ā23Updated last year