voicepaw / so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
☆8,894Updated this week
Alternatives and similar repositories for so-vits-svc-fork:
Users that are interested in so-vits-svc-fork are comparing it to the libraries listed below
- SoftVC VITS Singing Voice Conversion☆26,550Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,731Updated 9 months ago
- Easily train a good VC model with voice data <= 10 mins!☆27,116Updated 2 months ago
- リアルタイムボイスチェンジャー Realtime Voice Changer☆17,199Updated this week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,151Updated last year
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,015Updated last week
- Singing Voice Conversion via diffusion model☆2,667Updated last year
- A simple GUI application that slices audio with silence detection☆1,302Updated 6 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,827Updated last month
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,406Updated last year
- 多个SVC/TTS的C++推理库☆1,044Updated 4 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆19,532Updated 2 months ago
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,784Updated this week
- A multi-voice TTS system trained with an emphasis on quality☆13,707Updated 3 months ago
- 🔊 Text-Prompted Generative Audio Model☆36,988Updated 6 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,341Updated 7 months ago
- Executable file for VITS inference☆2,366Updated last year
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆2,114Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,788Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37,794Updated 6 months ago
- ☆10,131Updated 2 weeks ago
- WebUI extension for ControlNet☆17,371Updated 6 months ago
- Python script that slices audio with silence detection☆801Updated 8 months ago
- *CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other …☆1,097Updated last year
- vits2 backbone with multilingual-bert☆8,248Updated last week
- Official implementation of AnimateDiff.☆11,019Updated 6 months ago
- ☆7,751Updated 10 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,245Updated 8 months ago
- Stable diffusion for real-time music generation☆3,538Updated 6 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,542Updated 10 months ago