lukeewin / ASR_LLM_TTS
This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.
☆11Updated last month
Alternatives and similar repositories for ASR_LLM_TTS:
Users that are interested in ASR_LLM_TTS are comparing it to the libraries listed below
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆51Updated 4 months ago
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆89Updated 10 months ago
- ☆44Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆80Updated 4 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆67Updated 4 months ago
- Pseudo Streaming SenseVoice with Hotwords☆171Updated last month
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆44Updated 4 months ago
- 一个用于CosyVoice的api接口项目☆155Updated last week
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆35Updated 4 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆105Updated 3 weeks ago
- Bert-VITS2 onnx推理版本☆40Updated 9 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆308Updated 3 months ago
- ChatTTS HTTP API☆50Updated 7 months ago
- 异步语音对话组件。☆12Updated last month
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆308Updated 2 weeks ago
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆231Updated 9 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆161Updated 6 months ago
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆19Updated 2 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆67Updated this week
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆86Updated last month
- Real time faster whisper gradio☆26Updated 3 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨 网络的服务调用,实现实时对话的效果☆306Updated 3 weeks ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆19Updated 2 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆28Updated 3 months ago
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,时延低至800ms,低配置也可运行,支持打断☆432Updated last week
- Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL☆46Updated 7 months ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆15Updated 4 months ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆65Updated last month
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆156Updated 3 months ago
- An implementation of MeloTTS by onnxruntime☆18Updated 3 months ago