voicepaw / so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
☆8,848Updated this week
Alternatives and similar repositories for so-vits-svc-fork:
Users that are interested in so-vits-svc-fork are comparing it to the libraries listed below
- SoftVC VITS Singing Voice Conversion☆26,318Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,713Updated 8 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,042Updated last year
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,800Updated 6 months ago
- A simple GUI application that slices audio with silence detection☆1,286Updated 5 months ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆1,958Updated this week
- Easily train a good VC model with voice data <= 10 mins!☆26,090Updated last month
- リアルタイムボイスチェンジャー Realtime Voice Changer☆16,933Updated 2 months ago
- vits2 backbone with multilingual-bert☆8,183Updated this week
- Singing Voice Conversion via diffusion model☆2,662Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆19,040Updated last month
- WebUI extension for ControlNet☆17,277Updated 5 months ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,378Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,216Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,751Updated 11 months ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,820Updated 5 months ago
- Executable file for VITS inference☆2,360Updated last year
- 🔊 Text-Prompted Generative Audio Model☆36,678Updated 4 months ago
- 多个SVC/TTS的C++推理库☆1,030Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,325Updated this week
- fast-stable-diffusion + DreamBooth☆7,609Updated 2 weeks ago
- A repository of models, textual inversions, and more☆6,335Updated this week
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆8,759Updated 5 months ago
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,762Updated this week
- Python script that slices audio with silence detection☆794Updated 7 months ago
- SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.☆4,825Updated this week
- Official implementation of AnimateDiff.☆10,863Updated 5 months ago
- stable diffusion webui colab☆15,700Updated 3 months ago
- Faster Whisper transcription with CTranslate2☆13,490Updated 2 weeks ago
- roop extension for StableDiffusion web-ui☆3,427Updated 9 months ago