QwenLM/Qwen3-TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QwenLM/Qwen3-TTS)

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.

☆12,672

Alternatives and similar repositories for Qwen3-TTS

Users that are interested in Qwen3-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,246Jun 26, 2026Updated last month
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,753Jul 21, 2026Updated last week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,559Updated this week
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,251Jul 14, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
flybirdxx / ComfyUI-Qwen-TTS
View on GitHub
A Simple Implementation of Qwen3-TTS's ComfyUI
☆1,811Jun 3, 2026Updated last month
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,611Updated this week
OpenBMB / VoxCPM
View on GitHub
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
☆34,496Jul 8, 2026Updated 3 weeks ago
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,923Updated this week
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
andimarafioti / faster-qwen3-tts
View on GitHub
Real-time text-to-speech with Qwen3-TTS
☆1,261Jul 17, 2026Updated last week
QwenLM / Qwen3-Omni
View on GitHub
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆3,917Apr 23, 2026Updated 3 months ago
hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,168Aug 6, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jamiepine / voicebox
View on GitHub
The open-source AI voice studio. Clone, dictate, create.
☆47,343Updated this week
ace-step / ACE-Step-1.5
View on GitHub
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA …
☆11,890Updated this week
NVIDIA / personaplex
View on GitHub
PersonaPlex code.
☆10,273Mar 2, 2026Updated 4 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
studio-dots-ai / dots.tts
View on GitHub
☆997Jul 10, 2026Updated 2 weeks ago
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆15,042Jul 23, 2026Updated last week
kyutai-labs / pocket-tts
View on GitHub
A TTS that fits in your CPU (and pocket)
☆7,929Jul 16, 2026Updated 2 weeks ago
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆122,731Updated this week
neuphonic / neutts
View on GitHub
On-device TTS model by Neuphonic
☆6,207Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
KittenML / KittenTTS
View on GitHub
State-of-the-art TTS model under 25MB 😻
☆15,248Jun 11, 2026Updated last month
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆956Apr 9, 2026Updated 3 months ago
boson-ai / higgs-audio
View on GitHub
Text-audio foundation model from Boson AI
☆8,304Jun 5, 2026Updated last month
Lightricks / LTX-2
View on GitHub
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
☆8,450Jul 8, 2026Updated 3 weeks ago
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,542Updated this week
ysharma3501 / LuxTTS
View on GitHub
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
☆4,869Jun 5, 2026Updated last month
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,954Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆69,060Updated this week
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,264Dec 5, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆106,032Updated this week
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,547Updated this week
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,609Nov 19, 2025Updated 8 months ago
Tongyi-MAI / Z-Image
View on GitHub
☆11,797Feb 9, 2026Updated 5 months ago
openclaw / openclaw
View on GitHub
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
☆384,457Updated this week
meituan-longcat / LongCat-AudioDiT
View on GitHub
☆555Apr 3, 2026Updated 3 months ago
QwenAudio / Fun-Audio-Chat
View on GitHub
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
☆985Feb 27, 2026Updated 5 months ago