RVC-Project / Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
☆29,415Updated 5 months ago
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-WebUI
Users that are interested in Retrieval-based-Voice-Conversion-WebUI are comparing it to the libraries listed below
Sorting:
- リアルタイムボイスチェンジャー Realtime Voice Changer☆17,992Updated this week
- SoftVC VITS Singing Voice Conversion☆27,086Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆46,470Updated 3 weeks ago
- vits2 backbone with multilingual-bert☆8,415Updated this week
- so-vits-svc fork with realtime support, improved interface and more features.☆9,003Updated this week
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,904Updated 4 months ago
- SOTA Open Source TTS☆21,107Updated last month
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,432Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,731Updated 10 months ago
- Stable Diffusion web UI☆152,545Updated 2 weeks ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,160Updated 3 weeks ago
- 🔊 Text-Prompted Generative Audio Model☆37,838Updated 9 months ago
- WebUI extension for ControlNet☆17,621Updated 9 months ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,789Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆20,665Updated 2 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆77,172Updated this week
- Industry leading face manipulation platform☆23,003Updated this week
- LLM Frontend for Power Users.☆14,493Updated this week
- A generative speech model for daily dialogue.☆36,249Updated 2 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆40,062Updated 9 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,630Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆32,250Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,868Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆25,120Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆40,981Updated 7 months ago
- A latent text-to-image diffusion model☆70,677Updated 11 months ago
- ☆10,663Updated this week
- Real-time face swap for PC streaming or video calls☆28,536Updated 6 months ago
- A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large …☆5,832Updated last month
- A simple GUI application that slices audio with silence detection☆1,348Updated 9 months ago