MoonInTheRiver / DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆4,258Updated last year
Related projects: ⓘ
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,674Updated this week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆6,675Updated 9 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,696Updated 2 months ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,613Updated 4 months ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆1,803Updated 3 weeks ago
- vits2 backbone with multilingual-bert☆7,793Updated this week
- ☆3,410Updated this week
- Singing Voice Conversion via diffusion model☆2,621Updated last year
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,307Updated last year
- SoftVC VITS Singing Voice Conversion☆25,358Updated 10 months ago
- so-vits-svc fork with realtime support, improved interface and more features.☆8,674Updated this week
- A simple GUI application that slices audio with silence detection☆1,198Updated last month
- Executable file for VITS inference☆2,327Updated last year
- LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.☆4,388Updated this week
- 多个SVC/TTS的C++推理库☆984Updated last month
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,726Updated last month
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆908Updated 9 months ago
- tha3, but run 40fps on 3080 with virtural webcam support☆1,838Updated 2 months ago
- Python script that slices audio with silence detection☆754Updated 3 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,657Updated 2 months ago
- Yet another voice assistant, but alive.☆2,425Updated 9 months ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆10,928Updated this week
- Easily train a good VC model with voice data <= 10 mins!☆22,944Updated 2 weeks ago
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆1,992Updated 10 months ago
- Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0☆4,708Updated last month
- So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document☆656Updated last month
- WebUI extension for ControlNet☆16,825Updated last month
- An easy to understand TTS / SVS / SVC framework☆623Updated last month
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆968Updated last year
- ☆5,597Updated last year