UtaUtaUtau / diff-svc
Singing Voice Conversion via diffusion model
☆33Updated last year
Alternatives and similar repositories for diff-svc:
Users that are interested in diff-svc are comparing it to the libraries listed below
- Singing Voice Conversion via diffusion model☆57Updated last year
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆39Updated last year
- Voice model "LIEE" for DIFF-SVC by julieraptor☆34Updated last year
- Few-shot multilingual tts with RVC and Vits☆50Updated last year
- a ttkbootstrap gui tool for diff-svc☆12Updated 2 years ago
- vits2 backbone with multilingual-bert(한국어 지원)☆25Updated 9 months ago
- Drop-and-run script for Automatic1111's Stable Diffusion WebUI☆33Updated last year
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆38Updated 10 months ago
- Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS☆56Updated 2 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆65Updated 2 years ago
- Korean language support for NNSVS/ENUNU☆27Updated 9 months ago
- Launcher Automatic1111's Stable Diffusion WebUI☆13Updated last year
- Open-Free TTS Platform For All☆12Updated 3 weeks ago
- badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion☆31Updated 2 years ago
- A ChatWaifu Version with official ChatGPT API☆18Updated last year
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆34Updated 10 months ago
- ☆23Updated 5 months ago
- ddetailer + sd-upscaler script☆70Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆25Updated 2 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆12Updated last year
- Controllable Toonification using StyleGAN2 (StyleGAN2를 이용한 디즈니 만화풍 만들기)☆11Updated 2 years ago
- Extension program for DIFF-SVC to make it more easy to use☆17Updated 2 years ago
- Tweaked version of Mangio's fork of the Retrieval-based-Voice-Conversion WebUI☆10Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆65Updated last year
- Speech AI training and inference tools☆37Updated last year
- The better web ui for MOE-TTS☆23Updated last year
- A python package for midi clip ✂️☆18Updated last year
- Download scripts for NAVER Webtoon images☆60Updated 3 years ago