RVC-Project / Retrieval-based-Voice-Conversion-WebUILinks
Easily train a good VC model with voice data <= 10 mins!
☆30,118Updated 6 months ago
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-WebUI
Users that are interested in Retrieval-based-Voice-Conversion-WebUI are comparing it to the libraries listed below
Sorting:
- SoftVC VITS Singing Voice Conversion☆27,237Updated last year
- リアルタイムボイスチェンジャー Realtime Voice Changer☆18,209Updated last month
- so-vits-svc fork with realtime support, improved interface and more features.☆9,036Updated last week
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,804Updated last year
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,930Updated 5 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆47,858Updated this week
- vits2 backbone with multilingual-bert☆8,468Updated last week
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,213Updated 3 weeks ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,496Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆20,962Updated 3 months ago
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆14,629Updated last week
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,899Updated this week
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,522Updated 3 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,890Updated 11 months ago
- 🔊 Text-Prompted Generative Audio Model☆38,031Updated 10 months ago
- Let us control diffusion models!☆32,553Updated last year
- WebUI extension for ControlNet☆17,678Updated 10 months ago
- SOTA Open Source TTS☆21,914Updated last week
- Stable Diffusion web UI☆153,685Updated last month
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆79,795Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆40,844Updated 10 months ago
- *CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other …☆1,149Updated last year
- Industry leading face manipulation platform☆23,392Updated this week
- Bring portraits to life!☆16,277Updated last week
- Official implementation of AnimateDiff.☆11,502Updated 10 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,078Updated 10 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,141Updated 8 months ago
- A simple GUI application that slices audio with silence detection☆1,359Updated 10 months ago
- Official Code for DragGAN (SIGGRAPH 2023)☆35,915Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆14,672Updated last week