FranckyB/Voice-Clone-Studio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FranckyB/Voice-Clone-Studio)

FranckyB / Voice-Clone-Studio

A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.

☆508

Alternatives and similar repositories for Voice-Clone-Studio

Users that are interested in Voice-Clone-Studio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WhiskeyCoder / Qwen3-Audiobook-Converter
View on GitHub
Convert PDFs, EPUBs, DOCX, DOC, and TXT files into high-quality audiobooks using **Qwen3 TTS Voice Model** - an open-source voice synthes…
☆868Apr 7, 2026Updated 3 months ago
filliptm / ComfyUI-FL-Qwen3TTS
View on GitHub
Qwen3-TTS text-to-speech nodes for ComfyUI with voice cloning, voice design, and fine-tuning UI
☆135Apr 25, 2026Updated 3 months ago
DarioFT / ComfyUI-Qwen3-TTS
View on GitHub
A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models, Custom Voice, Voice Design, Voice Cloning and Fine-Tuning.
☆288Feb 9, 2026Updated 5 months ago
1038lab / ComfyUI-QwenASR
View on GitHub
A lightweight ComfyUI custom node pack for Qwen3-ASR, providing simple speech‑to‑text workflows with local model caching and optional tim…
☆64Jan 31, 2026Updated 5 months ago
1038lab / ComfyUI-QwenTTS
View on GitHub
ComfyUI custom nodes for speech, voice cloning, and voice design based on Qwen3-TTS models
☆227Jan 30, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FranckyB / ComfyUI-Prompt-Manager
View on GitHub
Prompt Manager for ComfyUI, with integration with llama.cpp for prompt generation. Allowing users to generate and save prompts, as well a…
☆134Updated this week
voicepowered-ai / VibeVoice-finetuning
View on GitHub
Unofficial WIP LoRa Finetuning repository for VibeVoice
☆371Sep 24, 2025Updated 10 months ago
Starnodes2024 / Qwen-Voice-TTS-Studio
View on GitHub
Easy to use GUI for Qwen TTS 3 for voice creating and cloning
☆30Jan 30, 2026Updated 5 months ago
ysharma3501 / LuxTTS
View on GitHub
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
☆4,869Jun 5, 2026Updated last month
flybirdxx / ComfyUI-Qwen-TTS
View on GitHub
A Simple Implementation of Qwen3-TTS's ComfyUI
☆1,811Jun 3, 2026Updated last month
diodiogod / TTS-Audio-Suite
View on GitHub
A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwe…
☆1,128Updated this week
DarioFT / ComfyUI-Qwen3-ASR
View on GitHub
ComfyUI custom nodes for Qwen3-ASR (Automatic Speech Recognition) - audio-to-text transcription supporting 52 languages and dialects.
☆190Jan 29, 2026Updated 6 months ago
BuffaloBuffaloBuffaloBuffalo / ai-toolkit-perceptual
View on GitHub
☆158Jun 14, 2026Updated last month
MarzEnt87 / ComfyUI-Workflows
View on GitHub
Just a series of workflows I
☆34Sep 10, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JarodMica / dataset-maker
View on GitHub
☆34Jan 25, 2026Updated 6 months ago
Saganaki22 / ComfyUI-AudioSR
View on GitHub
ComfyUI node for AudioSR - Versatile Audio Super Resolution upscales audio to 48kHz using latent diffusion
☆92Feb 12, 2026Updated 5 months ago
Saganaki22 / ComfyUI-ytdl_nodes
View on GitHub
Custom ComfyUI nodes for downloading, converting, and previewing audio/video from YouTube and 1,000+ other platforms
☆32Sep 6, 2025Updated 10 months ago
bc-dunia / qwen3-TTS-studio
View on GitHub
A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workf…
☆285Mar 30, 2026Updated 4 months ago
dasjoms / ChatterboxToolkitUI
View on GitHub
A comprehensive WebUI Toolkit for Resemble-AI's Chatterbox
☆28Jun 7, 2025Updated last year
andimarafioti / faster-qwen3-tts
View on GitHub
Real-time text-to-speech with Qwen3-TTS
☆1,261Jul 17, 2026Updated last week
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,672Mar 17, 2026Updated 4 months ago
Enemyx-net / VibeVoice-ComfyUI
View on GitHub
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …
☆1,521Feb 18, 2026Updated 5 months ago
LuqP2 / Image-MetaHub
View on GitHub
A desktop application for browsing, searching, and organizing AI-generated images locally. Designed for performance with large collection…
☆307Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jaskirat05 / Graviton
View on GitHub
Graviton: Daisy-Chain ComfyUI workflows. Distribute among multiple GPUs
☆44Mar 2, 2026Updated 4 months ago
cyberbol / AI-Video-Clipper-LoRA
View on GitHub
Automated video dataset creator for Windows using WhisperX and Qwen2-VL
☆18Updated this week
vanilsotae / vanilsotae
View on GitHub
♡
☆22Updated this week
Saganaki22 / ComfyUI-OmniVoice-TTS
View on GitHub
OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue
☆521Jun 11, 2026Updated last month
FranckyB / ComfyUI-DramaBox
View on GitHub
Port of resemble-ai's DramaBox for ComfyUI
☆44May 20, 2026Updated 2 months ago
AndyLone22 / MirrorMetrics
View on GitHub
MirrorMetrics: How to evaluate Stable Diffusion LoRAs. A visual diagnostic tool to detect overfitting, check dataset quality, and fix tra…
☆60Feb 21, 2026Updated 5 months ago
RunanywhereAI / on-device-browser-agent
View on GitHub
On-device AI browser automation using WebLLM. No cloud, no API keys, fully private.
☆296Jan 22, 2026Updated 6 months ago
wildminder / ComfyUI-VibeVoice
View on GitHub
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
☆588Sep 25, 2025Updated 10 months ago
nari-labs / dia2
View on GitHub
TTS model capable of streaming conversational audio in realtime.
☆1,160Nov 29, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ncoder-ai / VibeVoice-FastAPI
View on GitHub
FastAPI wrapper around original Vibevoice 1.5B and 7B models, with support for AWQ4 quant
☆33Jun 22, 2026Updated last month
modl-org / modl
View on GitHub
Local-first AI image generation toolkit. Pull models, train LoRAs, generate images. One CLI, no glue code.
☆25Updated this week
IAMCCS / comfyui-iamccs-workflows
View on GitHub
Workflows built for my patreon page!
☆88Jun 25, 2026Updated last month
kyutai-labs / pocket-tts
View on GitHub
A TTS that fits in your CPU (and pocket)
☆7,929Jul 16, 2026Updated last week
jordandare / echo-tts
View on GitHub
Echo-TTS inference codebase
☆212Dec 5, 2025Updated 7 months ago
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,611Updated this week
ScenemaAI / scenema-audio
View on GitHub
Zero-shot expressive voice cloning and speech generation. Generate anything from short clips to full-length audiobooks with realistic emo…
☆537Jul 7, 2026Updated 3 weeks ago