0x5446/api4sensevoice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/0x5446/api4sensevoice)

0x5446 / api4sensevoice

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

☆538

Alternatives and similar repositories for api4sensevoice

Users that are interested in api4sensevoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆467Jun 15, 2026Updated last month
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,935Updated this week
HG-ha / SenseVoice-Api
View on GitHub
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆112Sep 2, 2024Updated last year
ABexit / ASR-LLM-TTS
View on GitHub
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…
☆1,262Jun 3, 2026Updated last month
lovemefan / SenseVoice.cpp
View on GitHub
Port of Funasr's Sense-voice model in C/C++
☆568Dec 19, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RemSynch / SenseVoice-Real-Time
View on GitHub
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,467Updated this week
lovemefan / CT-Transformer-punctuation
View on GitHub
A enterprise-grade Chinese-English code switch punctuator from funasr.
☆34Apr 26, 2024Updated 2 years ago
qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆498Apr 26, 2025Updated last year
v3ucn / ASR_TOOLS_SenseVoice_WebUI
View on GitHub
Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型
☆182Jul 10, 2024Updated 2 years ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,070Dec 8, 2025Updated 7 months ago
78 / xiaozhi
View on GitHub
Build your own AI friend
☆778Jun 7, 2025Updated last year
ruzhila / voiceapi
View on GitHub
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆222Nov 2, 2025Updated 8 months ago
wwbin2017 / bailing
View on GitHub
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，接入openClaw，真正的个人语音助手，时延低至800ms，Mac等低配置也可运行，支持打断
☆1,742Apr 6, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lukeewin / AudioSeparationGUI
View on GitHub
这是一款基于FunASR实现的说话人分离的GUI程序
☆163Dec 14, 2025Updated 7 months ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,400May 25, 2026Updated 2 months ago
lovemefan / SenseVoice-python
View on GitHub
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆114Jun 12, 2026Updated last month
ultrasev / stream-whisper
View on GitHub
基于 faster-whisper 的伪实时语音转写服务
☆241Apr 29, 2025Updated last year
lovemefan / fsmn-vad
View on GitHub
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆139Apr 26, 2023Updated 3 years ago
Ikaros-521 / RealtimeSTT_LLM_TTS
View on GitHub
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆433Dec 31, 2024Updated last year
jianchang512 / cosyvoice-api
View on GitHub
一个用于CosyVoice的api接口项目
☆333Aug 31, 2025Updated 10 months ago
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,783Updated this week
HaujetZhao / FunASR-Online-Paraformer-Test
View on GitHub
☆52Nov 26, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xphh / fireredasr-streaming
View on GitHub
low-latency realtime ASR based on FireRedASR
☆62Jul 8, 2025Updated last year
DakeQQ / Voice-Activity-Detection-VAD-ONNX
View on GitHub
Utilizes ONNX Runtime for speech activity detection.
☆46Jun 25, 2026Updated last month
lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆27Jun 16, 2026Updated last month
Kedreamix / Linly-Talker
View on GitHub
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…
☆3,400Feb 10, 2026Updated 5 months ago
LuckLittleBoy / SenseVoice-OneApi
View on GitHub
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆92Sep 5, 2024Updated last year
Henry-23 / VideoChat
View on GitHub
实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…
☆1,296Dec 18, 2025Updated 7 months ago
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,940Feb 25, 2026Updated 5 months ago
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,209Dec 5, 2024Updated last year
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,499Updated this week
wangzongming / esp-ai
View on GitHub
The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧…
☆841Jan 9, 2026Updated 6 months ago
DakeQQ / Automatic-Speech-Recognition-ASR-ONNX
View on GitHub
Utilizes ONNX Runtime to transcribe audio into text.
☆85Jul 10, 2026Updated 2 weeks ago
harry0703 / AudioNotes
View on GitHub
快速提取音视频内容，整理成一份结构化的markdown笔记
☆2,222Updated this week
xinnan-tech / xiaozhi-esp32-server
View on GitHub
本项目为xiaozhi-esp32提供后端服务，帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
☆10,133Updated this week
TOM88812 / xiaozhi-web-client
View on GitHub
如果想体验小智项目，或者开发server端测试的同志，可以使用这个web端damo 体验下。语音端已经完成，文字端完成，可以语音加文字输出。等迭代慢慢完善。欢迎PR
☆183Jun 7, 2025Updated last year