lukeewin / ASR_LLM_TTSLinks
This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.
☆24Updated last year
Alternatives and similar repositories for ASR_LLM_TTS
Users that are interested in ASR_LLM_TTS are comparing it to the libraries listed below
Sorting:
- ASR_LLM_TTS前端项目☆15Updated last year
- CosyVoice语音合成简易API☆14Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音 识别。☆104Updated last year
- 异步语音对话组件。☆32Updated 9 months ago
- 一个用于CosyVoice的api接口项目☆331Updated 4 months ago
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆184Updated 9 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆89Updated last year
- ☆33Updated 10 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆430Updated last year
- 这是一款基于FunASR实现的说话人分离的GUI程序☆154Updated 3 weeks ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Updated last year
- 一个中文语音转文字项目,封装自FireRedASR☆81Updated 10 months ago
- Pseudo Streaming SenseVoice with Hotwords☆412Updated 9 months ago
- 小智AI的MQTT+UDP服务器,支持后端Websocket程序动态负载均衡☆97Updated 7 months ago
- 使用vllm加速cosyvoice2的推理☆465Updated 8 months ago
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆45Updated 5 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆180Updated 2 months ago
- ☆375Updated last year
- 基于 faster-whisper 的伪实时语音转写服务☆233Updated 8 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆348Updated 3 weeks ago
- WebUI build on SambertHifigan-TTS☆11Updated 2 years ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆81Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 11 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆47Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆92Updated last month
- 小智同学测试工具(websocket)☆47Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆59Updated last year
- 基于3D-Speaker的声纹识别API服务。用于识别小智设备说话人。☆96Updated 5 months ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆124Updated last week
- Added vLLM support to IndexTTS for faster inference.☆998Updated 2 months ago