RVC-Project / Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
☆27,116Updated 2 months ago
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-WebUI:
Users that are interested in Retrieval-based-Voice-Conversion-WebUI are comparing it to the libraries listed below
- SoftVC VITS Singing Voice Conversion☆26,571Updated last year
- リアルタイムボイスチェンジャー Realtime Voice Changer☆17,215Updated this week
- so-vits-svc fork with realtime support, improved interface and more features.☆8,897Updated this week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆40,898Updated this week
- vits2 backbone with multilingual-bert☆8,259Updated last week
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,827Updated last month
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,158Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,731Updated 9 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆19,532Updated 2 months ago
- WebUI extension for ControlNet☆17,371Updated 6 months ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,017Updated last week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆67,987Updated this week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,341Updated 7 months ago
- Let us control diffusion models!☆31,490Updated 11 months ago
- Real-time face swap for PC streaming or video calls☆27,639Updated 3 months ago
- stable diffusion webui colab☆15,748Updated 4 months ago
- Community interface for generative AI☆8,930Updated 9 months ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,838Updated this week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,881Updated 6 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆40,150Updated 4 months ago
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆13,616Updated this week
- ☆10,149Updated 3 weeks ago
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆8,926Updated 6 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37,869Updated 6 months ago
- 🔊 Text-Prompted Generative Audio Model☆36,988Updated 6 months ago
- Industry leading face manipulation platform☆21,538Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆30,988Updated last month
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,406Updated last year
- one-click face swap☆29,322Updated 6 months ago
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,787Updated this week