mco2004 / qwen-ttsLinks
Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generation on GitHub! 🚀🌟
☆108Updated this week
Alternatives and similar repositories for qwen-tts
Users that are interested in qwen-tts are comparing it to the libraries listed below
Sorting:
- project page for ChatAnyone☆115Updated 9 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 11 months ago
- ☆167Updated last year
- ChatTTS HTTP API☆54Updated last year
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆135Updated this week
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆47Updated 9 months ago
- qwen create prompt for sdxl☆34Updated 2 years ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 8 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Updated last year
- ☆282Updated last week
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆121Updated last month
- ☆160Updated 4 months ago
- Real time streaming digital human based on nerf☆18Updated last year
- ☆71Updated last month
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆99Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆68Updated last week
- SoulX-FlashTalk is the first 14B model to achieve a sub-second start-up latency (0.87s) while sustaining a real-time throughput of 32 FPS☆72Updated this week
- codewithgpu.com python client package☆20Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 11 months ago
- ☆483Updated 8 months ago
- ☆72Updated last week
- ☆191Updated last month
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆76Updated 6 months ago
- 基于MuseTalk的数字人代码。☆34Updated last year
- A lightweight end-to-end text-to-speech model☆125Updated 10 months ago
- ☆473Updated 7 months ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆28Updated last month
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆586Updated 2 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆71Updated 4 months ago