A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
☆331Mar 1, 2026Updated this week
Alternatives and similar repositories for Voice-Clone-Studio
Users that are interested in Voice-Clone-Studio are comparing it to the libraries listed below
Sorting:
- This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.☆103Jan 25, 2026Updated last month
- ☆17Feb 4, 2026Updated last month
- A comprehensive AI-powered video production studio. Features local batch processing for automated dubbing (XTTS), smart audio censorship …☆40Updated this week
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- Just a script i use for training YOLOs with most of parameters exposed and described.☆13Dec 13, 2024Updated last year
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Nov 5, 2022Updated 3 years ago
- Command palette with XState☆13Jan 6, 2023Updated 3 years ago
- ☆64Dec 16, 2025Updated 2 months ago
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆60Updated this week
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆58Oct 13, 2025Updated 4 months ago
- ☆28Aug 7, 2025Updated 7 months ago
- ☆21Feb 13, 2025Updated last year
- run home assistant rootless☆22Feb 21, 2026Updated 2 weeks ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆307Dec 15, 2025Updated 2 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- ☆98Oct 17, 2025Updated 4 months ago
- ☆150Updated this week
- WebAssembly SQLite with support for browser storage extensions☆23Aug 28, 2025Updated 6 months ago
- Pulstack – CLI tool for Instant Static Site Deployment with Pulumi☆25Apr 8, 2025Updated 10 months ago
- Improved qwen image editing accuracy☆34Dec 2, 2025Updated 3 months ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆107Feb 21, 2026Updated last week
- ☆96Dec 28, 2025Updated 2 months ago
- ☆22Apr 23, 2024Updated last year
- ☆20Sep 20, 2022Updated 3 years ago
- Make open source contribution less intimidating. Analyze repos before you contribute.☆108Feb 18, 2026Updated 2 weeks ago
- xstate form integration☆22Jan 19, 2021Updated 5 years ago
- Some music tools in ComfyUI☆123Dec 9, 2025Updated 2 months ago
- ☆115Dec 28, 2025Updated 2 months ago
- [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models☆132Dec 25, 2025Updated 2 months ago
- SkyReels V3: Multimodal Video Generation Model☆329Jan 30, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- IntrinsiX: High-Quality PBR Generation using Image Priors☆52Dec 8, 2025Updated 2 months ago
- This is system where images are trained and recognize of bumch of faces at a time☆23Oct 25, 2025Updated 4 months ago
- Coordinated Agent Team is a prompt-driven multi-agent system for autonomous software delivery. It defines clear agent roles, a determinis…☆30Feb 19, 2026Updated 2 weeks ago
- Some tools for 3D Geometry processing in ComfyUI. igl, CGAL, blender...☆154Updated this week
- ☆83Jan 25, 2026Updated last month
- Official implementation of Video-DPM☆173Jan 19, 2026Updated last month
- ☆72Updated this week
- Read, modify and write DICOS files with python code☆13Nov 24, 2025Updated 3 months ago