Plachtaa / seed-vc
zero-shot voice conversion & singing voice conversion, with real-time support
☆2,453Updated 3 weeks ago
Alternatives and similar repositories for seed-vc
Users that are interested in seed-vc are comparing it to the libraries listed below
Sorting:
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆1,716Updated this week
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆2,756Updated 2 weeks ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,603Updated last week
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,086Updated last week
- An Open-Sourced LLM-empowered Foundation TTS System☆698Updated last month
- Taming Stable Diffusion for Lip Sync!☆3,968Updated last week
- ☆1,308Updated 11 months ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,501Updated last week
- Multilingual Voice Understanding Model☆5,593Updated last month
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,740Updated 2 months ago
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆1,216Updated last week
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆2,690Updated last week
- Interface for OuteTTS models.☆1,227Updated 2 weeks ago
- ☆4,280Updated 2 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆1,766Updated this week
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,146Updated 3 weeks ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆3,845Updated 5 months ago
- TTS with kokoro and onnx runtime☆1,975Updated last week
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆377Updated 2 weeks ago
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆4,551Updated 2 months ago
- 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”☆2,395Updated 2 months ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,573Updated 9 months ago
- Real time interactive streaming digital human☆5,537Updated 2 weeks ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13,724Updated last week
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆660Updated 10 months ago
- Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…☆2,647Updated 2 months ago
- https://hf.co/hexgrad/Kokoro-82M☆2,777Updated 2 weeks ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆1,889Updated 2 months ago
- ☆5,266Updated last week
- CosyVoice在Windows环境下使用的版本☆677Updated 5 months ago