devidw / dswavLinks
Tooling to build datasets for audio model training
ā16Updated last year
Alternatives and similar repositories for dswav
Users that are interested in dswav are comparing it to the libraries listed below
Sorting:
- š Text-prompted Generative Audio Model - With the ability to clone voicesā20Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Forkā126Updated last year
- An unofficial PyTorch implementation of VALL-Eā87Updated this week
- Application of MB-iSTFT-VITS components to vits2_pytorchā126Updated 6 months ago
- Misc. tools/scripts that I made to use for tortoiseā21Updated 9 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioā67Updated last year
- Train the next generation of TTS systems.ā165Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā98Updated 7 months ago
- audiolm-pytorch training codeā15Updated last year
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.ā15Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWeiā146Updated last year
- StyleTTS 2 Optimized Training Forkā29Updated 4 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionā179Updated 8 months ago
- TorToiSe fine-tuning with DLASā220Updated 10 months ago
- Official implementation of the TTS model Lina-Speechā164Updated 4 months ago
- ā71Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationā132Updated last year
- Create training data for training a voice cloner for bark text to speech.ā45Updated last year
- ā257Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speechā236Updated last year
- Your one-stop solution for voice dataset creationā119Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)ā119Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingā127Updated 2 years ago
- ā229Updated 2 months ago
- openvino version of openai/whisperā166Updated last year
- ChatTTS is a generative speech model for daily dialogue.ā22Updated 4 months ago
- a Frontier Japanese Speech Generation netā39Updated 2 weeks ago
- ā140Updated last year
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā160Updated 10 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.ā112Updated last year