sdbds/Zonos-for-windows

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sdbds/Zonos-for-windows)

sdbds / Zonos-for-windows

☆501

Alternatives and similar repositories for Zonos-for-windows

Users that are interested in Zonos-for-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,229Mar 5, 2025Updated last year
BuffMcBigHuge / ComfyUI-Zonos
View on GitHub
ComfyUI node to make text to speech audio with your own voices.
☆72Apr 29, 2025Updated last year
HiDream-ai / HiDream-E1
View on GitHub
☆789Jul 17, 2025Updated last year
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,417Jan 9, 2026Updated 6 months ago
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,543Mar 10, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
deepbeepmeep / YuEGP
View on GitHub
YuE: Open Full-song Generation Foundation for the GPU Poor
☆481Feb 14, 2025Updated last year
Enemyx-net / VibeVoice-ComfyUI
View on GitHub
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …
☆1,519Feb 18, 2026Updated 5 months ago
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,990Jul 5, 2026Updated 2 weeks ago
kijai / ComfyUI-HunyuanVideoWrapper
View on GitHub
☆2,596Aug 20, 2025Updated 11 months ago
nxnai / Voost
View on GitHub
[SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
☆340May 15, 2026Updated 2 months ago
deepbeepmeep / Wan2GP
View on GitHub
A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, LTX-2, Qwen Image, Hunyuan Video, LTX Video and Flux.
☆6,689Updated this week
multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,338Jun 4, 2025Updated last year
NeuralFalconYT / Kokoro-TTS-Subtitle
View on GitHub
☆45Sep 21, 2025Updated 10 months ago
Mangio621 / Mangio-RVC-Fork
View on GitHub
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other …
☆1,228Sep 27, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daswer123 / xtts-webui
View on GitHub
Webui for using XTTS and for finetuning it
☆890Jan 17, 2025Updated last year
natlamir / LipStick
View on GitHub
A virtual makeup that will make faces look radiant! Get rid of that ugly face mask box on your videos: Get your magic Lipstick now!
☆11Sep 16, 2024Updated last year
lum3on / comfyui_HiDream-Sampler
View on GitHub
ComfyUI Wrapper for HiDream
☆479Apr 22, 2025Updated last year
hyz317 / StdGEN
View on GitHub
[CVPR 2025] StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
☆388Apr 17, 2026Updated 3 months ago
BahaC / ComfyUI-ZonosTTS
View on GitHub
ComfyUI Implementation of Zonos Text to Speech Model
☆26Feb 19, 2025Updated last year
Lightricks / ComfyUI-LTXVideo
View on GitHub
LTX-Video Support for ComfyUI
☆3,979Jun 30, 2026Updated 3 weeks ago
sdbds / hallo-for-windows
View on GitHub
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
☆207Jun 25, 2024Updated 2 years ago
Glat0s / VideoVoiceSwap
View on GitHub
Zeroshot Video voice swapper, speech2rir & blindRT60, Speaker & Scene change detection
☆20May 31, 2025Updated last year
Kr1sJFU / iMontage
View on GitHub
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
☆188Dec 1, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
axel-devs / VisoMaster-Job-Manager
View on GitHub
A fully fledged job management system mod for VisoMaster with numerous other improvements
☆13May 31, 2025Updated last year
JarodMica / audiobook_maker
View on GitHub
☆582Feb 21, 2026Updated 5 months ago
FranckyB / ComfyUI-DramaBox
View on GitHub
Port of resemble-ai's DramaBox for ComfyUI
☆43May 20, 2026Updated 2 months ago
woct0rdho / triton-windows
View on GitHub
Fork of the Triton language and compiler for Windows support and easy installation
☆1,955Feb 18, 2026Updated 5 months ago
NUS-HPC-AI-Lab / Enhance-A-Video
View on GitHub
Enhance-A-Video: Better Generated Video for Free
☆598Mar 17, 2025Updated last year
nivibilla / local-llasa-tts
View on GitHub
Examples of using the llasa-tts models locally
☆178Apr 20, 2025Updated last year
Fictiverse / bark
View on GitHub
🔊 Text-prompted Generative Audio Model
☆236Apr 27, 2023Updated 3 years ago
HiDream-ai / HiDream-I1
View on GitHub
☆2,510Jul 16, 2025Updated last year
hykilpikonna / HiDream-I1-nf4
View on GitHub
4Bit Quantized Model for HiDream I1
☆246May 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wildminder / ComfyUI-KaniTTS
View on GitHub
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
☆38Oct 17, 2025Updated 9 months ago
Fantasy-AMAP / fantasy-portrait
View on GitHub
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
☆511Aug 20, 2025Updated 11 months ago
zackabrams / ComfyUI-MagicWan
View on GitHub
Implementing FlowEdit, maybe other inversion techniques for the Wan video generation model
☆54Feb 28, 2025Updated last year
Fantasy-AMAP / fantasy-talking
View on GitHub
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
☆1,622Jan 26, 2026Updated 5 months ago
logtd / ComfyUI-HunyuanLoom
View on GitHub
A set of nodes to edit videos using the Hunyuan Video model
☆491Feb 21, 2025Updated last year
PangzeCheung / OmniTransfer
View on GitHub
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer
☆233Apr 15, 2026Updated 3 months ago
ASLP-lab / DiffRhythm
View on GitHub
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
☆2,322Nov 27, 2025Updated 7 months ago