mco2004 / qwen-ttsLinks

Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generation on GitHub! 🚀🌟

☆108

Alternatives and similar repositories for qwen-tts

Users that are interested in qwen-tts are comparing it to the libraries listed below

Sorting:

HumanAIGC / chat-anyone
project page for ChatAnyone
☆115Updated 9 months ago
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆108Updated 3 months ago
warmshao / ChatTTSPlus
Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment
☆174Updated 11 months ago
nethermanpro / transvip
☆167Updated last year
CyberWon / ChatTTS-API
ChatTTS HTTP API
☆54Updated last year
oneCodeSuperman / LstmSync
开源的LstmSync数字人泛化模型，只做最好的泛化模型！
☆135Updated this week
c4fun / tell-stories-webui
Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play
☆47Updated 9 months ago
zhongpei / Qwen-SDXL-Turbo
qwen create prompt for sdxl
☆34Updated 2 years ago
Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …
☆83Updated 8 months ago
jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆43Updated last year
Tencent-Hunyuan / HY-MT
☆282Updated last week
AIGeeksGroup / PresentAgent
[EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation
☆121Updated last month
ByteDance-Seed / Seed-X-7B
☆160Updated 4 months ago
dongdongzi / metahuman-stream
Real time streaming digital human based on nerf
☆18Updated last year
modelscope / flowra
☆71Updated last month
v3ucn / OpenVoiceV2_Webui_resemble_enhance
基于OpenVoice和Melotts整合的中文版webui，添加resemble_enhance音频增强功能
☆99Updated last year
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆68Updated last week
Soul-AILab / SoulX-FlashTalk
SoulX-FlashTalk is the first 14B model to achieve a sub-second start-up latency (0.87s) while sustaining a real-time throughput of 32 FPS
☆72Updated this week
seetacloud / codewithgpu
codewithgpu.com python client package
☆20Updated 2 years ago
LB-Young / Bambo
Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…
☆33Updated 11 months ago
maitrix-org / Voila
☆483Updated 8 months ago
multimodal-art-projection / AutoMV
☆72Updated last week
Tencent / POINTS-Reader
☆191Updated last month
byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆76Updated 6 months ago
IronSpiderMan / MuseTalkPlus
基于MuseTalk的数字人代码。
☆34Updated last year
mush42 / optispeech
A lightweight end-to-end text-to-speech model
☆125Updated 10 months ago
MYZY-AI / Muyan-TTS
☆473Updated 7 months ago
Ma-Hongbo / StyleTailor
Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”
☆28Updated last month
FunAudioLLM / Fun-Audio-Chat
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
☆586Updated 2 weeks ago
chentuochao / Spatial-Speech-Translation
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆71Updated 4 months ago