bbeyondllove / asr_serverLinks
一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。
☆76Updated last month
Alternatives and similar repositories for asr_server
Users that are interested in asr_server are comparing it to the libraries listed below
Sorting:
- golang版本的小智后端服务,支持websocket和mqtt+udp☆153Updated this week
- 本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)☆150Updated 10 months ago
- ☆383Updated last week
- Pseudo Streaming SenseVoice with Hotwords☆426Updated 10 months ago
- 小智AI的MQTT+UDP服务器,支持后端Websocket程序动态负载均衡☆102Updated 8 months ago
- ☆13Updated 2 years ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆540Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆188Updated 3 months ago
- Xiaozhi websocket protocol implemented by Golang, setup your own xiaozhi-server by routing requests to OpenAI Realtime API protocol such…☆41Updated 8 months ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆131Updated 3 weeks ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- Port of Funasr's Sense-voice model in C/C++☆514Updated last month
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆367Updated last month
- stt websockect server using sherpa-onnx☆46Updated 6 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated last year
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆1,604Updated 6 months ago
- 使用vllm加速cosyvoice2的推理☆481Updated 9 months ago
- 一个用于CosyVoice的api接口项目☆335Updated 5 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆873Updated last week
- 如果想体验小智项目,或者开发server端测试的同志,可以使用这个web端damo 体验下。 语音端已经完成,文字端完成,可以语音加文字输出。 等迭代慢慢完善。欢迎PR☆177Updated 8 months ago
- Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用☆59Updated last month
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆586Updated 8 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆92Updated last year
- ☆18Updated 10 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆17Updated 2 weeks ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆67Updated 5 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Updated last year
- Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Spee…☆104Updated this week
- RTC AIGC Demo☆246Updated 2 months ago