一个用于CosyVoice的api接口项目
☆336Aug 31, 2025Updated 6 months ago
Alternatives and similar repositories for cosyvoice-api
Users that are interested in cosyvoice-api are comparing it to the libraries listed below
Sorting:
- ☆33Feb 28, 2025Updated last year
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆189Mar 13, 2025Updated 11 months ago
- 使用vllm加速cosyvoice2的推理☆484Apr 26, 2025Updated 10 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,786Feb 11, 2026Updated 3 weeks ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Oct 28, 2024Updated last year
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆92Sep 5, 2024Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆106Sep 2, 2024Updated last year
- CosyVoice在Windows环境下使用的版本☆755Nov 19, 2024Updated last year
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- 基于官方提供的CosyVoice改造,整体交互适配CosyVoice2模型,开箱即用☆22Jun 15, 2025Updated 8 months ago
- Multilingual Voice Understanding Model☆7,611Dec 30, 2025Updated 2 months ago
- Added vLLM support to IndexTTS for faster inference.☆1,075Updated this week
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆1,134Mar 1, 2025Updated last year
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆1,616Jul 31, 2025Updated 7 months ago
- 一个用于F5-TTS的api和webui项目☆64Dec 25, 2024Updated last year
- 内容审核及速率限 制服务☆26May 18, 2025Updated 9 months ago
- Pseudo Streaming SenseVoice with Hotwords☆429Mar 13, 2025Updated 11 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Aug 13, 2024Updated last year
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆1,386Feb 3, 2026Updated last month
- GLM-4-Voice | 端到端中英语音对话模型☆3,144Dec 5, 2024Updated last year
- 实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…☆1,211Dec 18, 2025Updated 2 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Oct 23, 2024Updated last year
- 用于kokoro TTS的webui界面和兼容openai api☆39Feb 4, 2025Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Aug 8, 2025Updated 6 months ago
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- Step-Audio-TTS-3B demo☆13Feb 25, 2025Updated last year
- 一个超轻量级、可以在移动端实时运行的数字人模型☆2,427Sep 18, 2025Updated 5 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,036Updated this week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Real time interactive streaming digital human☆7,176Feb 24, 2026Updated last week
- Python的音频工具☆16Dec 5, 2025Updated 3 months ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- ☆16Apr 11, 2024Updated last year
- Just a suturing monster project.☆42Nov 21, 2023Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago