bbeyondllove / asr_serverLinks
一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。
☆50Updated 4 months ago
Alternatives and similar repositories for asr_server
Users that are interested in asr_server are comparing it to the libraries listed below
Sorting:
- golang版本的小智后端服务,支持websocket和mqtt+udp☆109Updated 2 weeks ago
- 本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)☆150Updated 8 months ago
- ☆339Updated this week
- Pseudo Streaming SenseVoice with Hotwords☆398Updated 8 months ago
- 一个用于CosyVoice的api接口项目☆323Updated 3 months ago
- 小智AI的MQTT+UDP服务器,支持后端Websocket程序动态负载均衡☆90Updated 6 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- Added vLLM support to IndexTTS for faster inference.☆911Updated last month
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆529Updated last year
- Xiaozhi websocket protocol implemented by Golang, setup your own xiaozhi-server by routing requests to OpenAI Realtime API protocol such…☆39Updated 6 months ago
- Port of Funasr's Sense-voice model in C/C++☆484Updated 2 months ago
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆557Updated 6 months ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆1,047Updated last week
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆177Updated 8 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆173Updated 9 months ago
- 如果想体验小智项目,或者开发server端测试的同志,可以使用这个web端damo 体验下。 语音端已经完成,文字端完成,可以语音加文字输出。 等迭代慢慢完善。欢迎PR☆166Updated 6 months ago
- 使用vllm加速cosyvoice2的推理☆458Updated 7 months ago
- 异步语音对话组件。☆30Updated 8 months ago
- ☆13Updated 2 years ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆319Updated last month
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆46Updated last year
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆1,534Updated 4 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆667Updated last week
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样例☆21Updated 6 months ago
- 这是一款基于FunASR实现的说话人分离的GUI程序☆141Updated 4 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆65Updated 3 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆169Updated last month
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆41Updated last year
- ☆31Updated 9 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆88Updated last year