UtaUtaUtau / diff-svcLinks
Singing Voice Conversion via diffusion model
☆32Updated 2 years ago
Alternatives and similar repositories for diff-svc
Users that are interested in diff-svc are comparing it to the libraries listed below
Sorting:
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆40Updated 2 years ago
- Singing Voice Conversion via diffusion model☆58Updated 2 years ago
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆38Updated last year
- Few-shot multilingual tts with RVC and Vits☆51Updated 2 years ago
- Drop-and-run script for Automatic1111's Stable Diffusion WebUI☆32Updated last year
- Voice model "LIEE" for DIFF-SVC by julieraptor☆34Updated 8 months ago
- Launcher Automatic1111's Stable Diffusion WebUI☆13Updated last year
- Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS☆64Updated 3 years ago
- vits2 backbone with multilingual-bert(한국어 지원)☆27Updated last year
- The better web ui for MOE-TTS☆24Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆10Updated 6 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆35Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆28Updated 3 years ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆13Updated 2 years ago
- ☆86Updated 2 years ago
- ddetailer + sd-upscaler script☆69Updated 2 years ago
- ☆22Updated last year
- ☆113Updated 6 months ago
- Talking Head(?) Anime from a Single Image 4: Improved Model and Its Distillation☆60Updated 10 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆69Updated 2 years ago
- Download scripts for NAVER Webtoon images☆60Updated 4 years ago
- Yet another anime face detector based on yolov5.☆97Updated last year
- ☆26Updated 2 years ago
- use images to seed video generation☆21Updated 2 years ago
- VALL-E 한국어 버전☆12Updated 2 years ago
- badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion☆31Updated 2 years ago
- AI video temporal coherence Lab☆56Updated 2 years ago
- Korean language support for NNSVS/ENUNU☆28Updated last year
- ☆43Updated 2 years ago
- Open-Free TTS Platform For All☆12Updated 9 months ago