jianchang512/cosyvoice-api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jianchang512/cosyvoice-api)

jianchang512 / cosyvoice-api

一个用于CosyVoice的api接口项目

☆333

Alternatives and similar repositories for cosyvoice-api

Users that are interested in cosyvoice-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

diudiu62 / CosyVoice-api
View on GitHub
☆33Feb 28, 2025Updated last year
journey-ad / CosyVoice2-Ex
View on GitHub
CosyVoice2 功能扩充（预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API）
☆195Mar 13, 2025Updated last year
qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆498Apr 26, 2025Updated last year
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,400May 25, 2026Updated 2 months ago
LuckLittleBoy / SenseVoice-OneApi
View on GitHub
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆92Sep 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HG-ha / SenseVoice-Api
View on GitHub
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆112Sep 2, 2024Updated last year
kodeleung / CosyVoice2
View on GitHub
基于官方提供的CosyVoice改造，整体交互适配CosyVoice2模型，开箱即用
☆23Jun 15, 2025Updated last year
v3ucn / CosyVoice_For_Windows
View on GitHub
CosyVoice在Windows环境下使用的版本
☆766Nov 19, 2024Updated last year
jianchang512 / sense-api
View on GitHub
用于SenseVoice的api项目，输出带时间戳字幕
☆49Oct 28, 2024Updated last year
catcto / CosyVoiceDocker
View on GitHub
This repository provides a Docker image for CosyVoice
☆27Dec 22, 2024Updated last year
easygoingbl / auditlimit
View on GitHub
内容审核及速率限制服务
☆26May 18, 2025Updated last year
302ai / 302_vector_graphics_generation
View on GitHub
🖼️🤖 302 Vector Graphics Generation! 🚀✨
☆17Aug 26, 2025Updated 11 months ago
BiboyQG / bob-cosyvoice
View on GitHub
A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.
☆39Aug 13, 2024Updated last year
wwbin2017 / bailing
View on GitHub
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，接入openClaw，真正的个人语音助手，时延低至800ms，Mac等低配置也可运行，支持打断
☆1,742Apr 6, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ABexit / ASR-LLM-TTS
View on GitHub
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…
☆1,262Jun 3, 2026Updated last month
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,935Updated this week
shenduldh / CosyVoice-Lightning
View on GitHub
Lightning-responsive CosyVoice streaming API based on FastAPI.
☆28Apr 27, 2026Updated 2 months ago
ZV-Liu / Step-Audio
View on GitHub
Step-Audio-TTS-3B demo
☆13Feb 25, 2025Updated last year
Ksuriuri / index-tts-vllm
View on GitHub
Added vLLM support to IndexTTS for faster inference.
☆1,209Apr 13, 2026Updated 3 months ago
lenML / Speech-AI-Forge
View on GitHub
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
☆1,415May 21, 2026Updated 2 months ago
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,209Dec 5, 2024Updated last year
Henry-23 / VideoChat
View on GitHub
实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…
☆1,296Dec 18, 2025Updated 7 months ago
lukeewin / ASR_LLM_TTS_Front
View on GitHub
ASR_LLM_TTS前端项目
☆15Dec 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆467Jun 15, 2026Updated last month
0x5446 / api4sensevoice
View on GitHub
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆538Oct 23, 2024Updated last year
chengyuanba / avatar_ernerf
View on GitHub
Just a suturing monster project.
☆38Nov 21, 2023Updated 2 years ago
HonestQiao / xiaozhi-py
View on GitHub
小智同学测试工具(websocket)
☆45Feb 20, 2025Updated last year
zaigie / FunSpeech
View on GitHub
开箱即用的本地私有化部署语音服务，快速搭建Qwen3ASR/FunASR与Qwen3TTS/CosyVoice后端
☆153Jul 6, 2026Updated 2 weeks ago
makerjackie / ChatTTS-api-ui-docker
View on GitHub
One command to run ChatTTS
☆60Jun 6, 2024Updated 2 years ago
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,499Jul 19, 2026Updated last week
jianchang512 / f5-tts-api
View on GitHub
一个用于F5-TTS的api和webui项目
☆63Dec 25, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,467Updated this week
ZygoteCode / VadSharp
View on GitHub
Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, …
☆10Apr 20, 2025Updated last year
jianchang512 / ChatTTS-ui
View on GitHub
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…
☆7,626Jun 14, 2026Updated last month
anliyuan / Ultralight-Digital-Human
View on GitHub
一个超轻量级、可以在移动端实时运行的数字人模型
☆2,590Updated this week
lewangdev / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆13Jul 15, 2024Updated 2 years ago
jianchang512 / gptsovits-api
View on GitHub
适用于 GPT-SoVITS 的api调用接口
☆344Mar 7, 2024Updated 2 years ago
PeterH0323 / Streamer-Sales
View on GitHub
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文…
☆3,743Mar 8, 2025Updated last year