devidw / dswav
Tooling to build datasets for audio model training
☆16Updated last year
Alternatives and similar repositories for dswav:
Users that are interested in dswav are comparing it to the libraries listed below
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- ☆206Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated this week
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 10 months ago
- Official implementation of the TTS model Lina-Speech☆157Updated 2 months ago
- a Frontier Japanese Speech Generation net☆26Updated this week
- Train the next generation of TTS systems.☆163Updated 6 months ago
- audiolm-pytorch training code☆15Updated last year
- Create training data for training a voice cloner for bark text to speech.☆43Updated last year
- TorToiSe fine-tuning with DLAS☆218Updated 7 months ago
- Running the F5-TTS by ONNX Runtime☆115Updated last week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆173Updated 5 months ago
- ☆62Updated 7 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆141Updated last year
- Community framework for training tortoise☆40Updated 2 years ago
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.☆15Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- ☆95Updated 10 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆268Updated last year
- Your one-stop solution for voice dataset creation☆117Updated last year
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆29Updated last week
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆205Updated 10 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆154Updated 8 months ago
- ☆253Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆123Updated 3 months ago
- ☆155Updated 2 months ago
- The reproduced code for Google's SoundStorm☆263Updated last year