bbeyondllove / asr_serverLinks
一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。
☆60Updated 5 months ago
Alternatives and similar repositories for asr_server
Users that are interested in asr_server are comparing it to the libraries listed below
Sorting:
- ☆346Updated 3 weeks ago
- 本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)☆150Updated 8 months ago
- golang版本的小智后端服务,支持websocket和mqtt+udp☆111Updated last week
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆409Updated 9 months ago
- Xiaozhi websocket protocol implemented by Golang, setup your own xiaozhi-server by routing requests to OpenAI Realtime API protocol such…☆39Updated 7 months ago
- 小智AI的MQTT+UDP服务器,支持后端Websocket程序动态负载均衡☆95Updated 7 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆532Updated last year
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆335Updated last week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆177Updated last month
- 一个用于CosyVoice的api接口项目☆327Updated 3 months ago
- Port of Funasr's Sense-voice model in C/C++☆500Updated last week
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆89Updated last year
- 如果想体验小智项目,或者开发server端测试的同志,可以使用这个web端damo 体验下。 语音端已经完成,文字端完成,可以语音加文字输出。 等迭代慢慢完善。欢迎PR☆173Updated 6 months ago
- 使用vllm加速cosyvoice2的推理☆465Updated 8 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 10 months ago
- 异步语音对话组件。☆31Updated 9 months ago
- stt websockect server using sherpa-onnx☆41Updated 4 months ago
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆565Updated 7 months ago
- Added vLLM support to IndexTTS for faster inference.☆963Updated 2 months ago
- 基于3D-Speaker的声纹识别API服务。用于识 别小智设备说话人。☆92Updated 5 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆503Updated this week
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆429Updated 11 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Updated last year
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆182Updated 9 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆65Updated 4 months ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆124Updated last week
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆684Updated last month
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆1,060Updated 2 weeks ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆93Updated last month