svc-develop-team / so-vits-svc
SoftVC VITS Singing Voice Conversion
☆26,807Updated last year
Alternatives and similar repositories for so-vits-svc:
Users that are interested in so-vits-svc are comparing it to the libraries listed below
- so-vits-svc fork with realtime support, improved interface and more features.☆8,946Updated last week
- Easily train a good VC model with voice data <= 10 mins!☆28,227Updated 4 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,279Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆43,162Updated this week
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,744Updated 11 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,864Updated 2 months ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,431Updated last week
- vits2 backbone with multilingual-bert☆8,345Updated this week
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,817Updated last week
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,085Updated last month
- 基于vits与softvc的歌声音色转换模型☆3,679Updated 5 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆20,004Updated 2 weeks ago
- リアルタイムボイスチェンジャー Realtime Voice Changer☆17,603Updated last month
- stable diffusion webui colab☆15,803Updated 5 months ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆33,694Updated last week
- A simple GUI application that slices audio with silence detection☆1,324Updated 8 months ago
- 🔊 Text-Prompted Generative Audio Model☆37,321Updated 7 months ago
- Singing Voice Conversion via diffusion model☆2,668Updated last year
- A generative speech model for daily dialogue.☆35,409Updated 2 weeks ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,510Updated 9 months ago
- Executable file for VITS inference☆2,376Updated last year
- WebUI extension for ControlNet☆17,466Updated 7 months ago
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆36,042Updated 4 months ago
- Reverse engineered ChatGPT API☆28,056Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,616Updated 6 months ago
- Stable Diffusion web UI☆150,015Updated 3 weeks ago
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆14,007Updated this week
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆9,086Updated 7 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆40,628Updated 5 months ago
- Let us control diffusion models!☆31,864Updated last year