mco2004 / qwen-ttsLinks
Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generation on GitHub! 🚀🌟
☆81Updated this week
Alternatives and similar repositories for qwen-tts
Users that are interested in qwen-tts are comparing it to the libraries listed below
Sorting:
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- project page for ChatAnyone☆115Updated 7 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆42Updated 7 months ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆22Updated 2 months ago
- ☆166Updated 11 months ago
- codewithgpu.com python client package☆19Updated 2 years ago
- Real time streaming digital human based on nerf☆17Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 9 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆107Updated last month
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- qwen create prompt for sdxl☆34Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆78Updated 5 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- ChatTTS HTTP API☆54Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated last year
- ☆184Updated last month
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆52Updated 6 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆98Updated last year
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29Updated last year
- Incredibly descriptive audiovisual summaries for videos☆40Updated last year
- livekit agent plugins☆22Updated 3 weeks ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆111Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- 获取bilibili直播弹幕,使用WebSocket协议☆37Updated last year
- 实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换☆56Updated 3 years ago
- xllamacpp - a Python wrapper of llama.cpp☆62Updated last week