devidw / dswav
Tooling to build datasets for audio model training
β16Updated last year
Alternatives and similar repositories for dswav:
Users that are interested in dswav are comparing it to the libraries listed below
- audiolm-pytorch training codeβ15Updated last year
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ20Updated last year
- An unofficial PyTorch implementation of VALL-Eβ87Updated this week
- β29Updated 2 weeks ago
- Running the F5-TTS by ONNX Runtimeβ91Updated this week
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.β15Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ67Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ114Updated 2 weeks ago
- Create training data for training a voice cloner for bark text to speech.β43Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationβ130Updated last year
- Train the next generation of TTS systems.β162Updated 4 months ago
- Official implementation of the TTS model Lina-Speechβ150Updated 3 weeks ago
- Faster Tortoise inference then Tortoise Fast Forkβ126Updated 9 months ago
- β62Updated 6 months ago
- β70Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β90Updated 3 months ago
- F5-TTS ζ¨ηε ιοΌιεΊ¦ζεηΊ¦4εοΌβ24Updated 3 weeks ago
- β33Updated last year
- All generative model in one for better TTS modelβ66Updated 4 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ126Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorchβ121Updated 2 months ago
- β195Updated 3 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speakeβ¦β57Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesisβ126Updated 2 weeks ago
- β26Updated 10 months ago
- Community framework for training tortoiseβ40Updated 2 years ago
- β28Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ169Updated 4 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWeiβ139Updated last year
- The reproduced code for Google's SoundStormβ262Updated last year