RVC-Project / Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
☆26,090Updated last month
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-WebUI:
Users that are interested in Retrieval-based-Voice-Conversion-WebUI are comparing it to the libraries listed below
- リアルタイムボイスチェンジャー Realtime Voice Changer☆16,933Updated 2 months ago
- SoftVC VITS Singing Voice Conversion☆26,318Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆38,875Updated 2 weeks ago
- so-vits-svc fork with realtime support, improved interface and more features.☆8,848Updated this week
- GUI for a Vocal Remover that uses Deep Neural Networks.☆19,040Updated last month
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,800Updated 6 months ago
- vits2 backbone with multilingual-bert☆8,183Updated this week
- Stable Diffusion web UI☆145,963Updated 2 weeks ago
- WebUI extension for ControlNet☆17,277Updated 5 months ago
- 🔊 Text-Prompted Generative Audio Model☆36,678Updated 4 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆63,655Updated this week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,042Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,216Updated 6 months ago
- Let us control diffusion models!☆31,207Updated 10 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆41,616Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆39,774Updated 3 months ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,713Updated 8 months ago
- ☆9,958Updated 2 weeks ago
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,568Updated 5 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,325Updated this week
- SOTA Open Source TTS☆18,396Updated this week
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,378Updated last year
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆1,958Updated this week
- Official implementation of AnimateDiff.☆10,863Updated 5 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆36,915Updated 5 months ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,820Updated 5 months ago
- Industry leading face manipulation platform☆20,969Updated this week
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆24,165Updated this week
- stable diffusion webui colab☆15,700Updated 3 months ago
- A generative speech model for daily dialogue.☆33,664Updated this week