devidw / dswav
Tooling to build datasets for audio model training
β16Updated last year
Alternatives and similar repositories for dswav:
Users that are interested in dswav are comparing it to the libraries listed below
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ20Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWeiβ144Updated last year
- a Frontier Japanese Speech Generation netβ32Updated last month
- Train the next generation of TTS systems.β165Updated 7 months ago
- Official implementation of the TTS model Lina-Speechβ164Updated 3 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β95Updated 6 months ago
- An unofficial PyTorch implementation of VALL-Eβ87Updated 2 weeks ago
- Official Implementation of StyleTTS-VCβ178Updated 3 months ago
- β255Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ176Updated 7 months ago
- Your one-stop solution for voice dataset creationβ119Updated last year
- β71Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-Eβ138Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ67Updated last year
- F5-TTS ζ¨ηε ιοΌιεΊ¦ζεηΊ¦4εοΌβ80Updated 4 months ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speechβ236Updated last year
- β62Updated 9 months ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Modelβ207Updated last year
- ChatTTS is a generative speech model for daily dialogue.β22Updated 3 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.β162Updated 11 months ago
- The reproduced code for Google's SoundStormβ265Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorchβ126Updated 5 months ago
- β33Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisβ262Updated last month
- YuE with mp3 extend, exllama and GUIβ48Updated 2 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)β119Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ129Updated last year
- Create training data for training a voice cloner for bark text to speech.β44Updated last year
- All generative model in one for better TTS modelβ67Updated 7 months ago