ABexit / ASR-LLM-TTSLinks

This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three TTS models: CosyVoice, Edge-TTS, and pyttsx3

☆893

Alternatives and similar repositories for ASR-LLM-TTS

Users that are interested in ASR-LLM-TTS are comparing it to the libraries listed below

Sorting:

0x5446 / api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆481Updated 9 months ago
wwbin2017 / bailing
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，时延低至800ms，Mac等低配置也可运行，支持打断
☆1,370Updated last week
Ikaros-521 / RealtimeSTT_LLM_TTS
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆410Updated 7 months ago
TommyZihao / ChatTTS_Tutorials
Step-by-step Jupyter notebook tutorials for ChatTTS
☆165Updated last year
jianchang512 / cosyvoice-api
一个用于CosyVoice的api接口项目
☆304Updated 6 months ago
Henry-23 / VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, su…
☆1,041Updated 4 months ago
HumanAIGC-Engineering / OpenAvatarChat
☆1,670Updated this week
jianchang512 / gptsovits-api
适用于 GPT-SoVITS 的api调用接口
☆298Updated last year
78 / xiaozhi
Build your own AI friend
☆652Updated 2 months ago
pengzhendong / streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
☆332Updated 4 months ago
ultrasev / stream-whisper
基于 faster-whisper 的伪实时语音转写服务
☆224Updated 3 months ago
BiboyQG / bob-cosyvoice
A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.
☆39Updated 11 months ago
HuiResearch / FlashTTS
基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。
☆504Updated 2 months ago
kleinlee / DH_live
每个人都能用的数字人
☆1,601Updated 2 months ago
FireRedTeam / FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,235Updated 4 months ago
aliyun / alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
☆247Updated last week
Lucky-183 / PI-Assistant
基于树莓派和GPT实现的多功能语音家庭助手 A multifunctional voice home assistant based on Raspberry Pi and GPT
☆238Updated 4 months ago
ben0oil1 / GPT-SoVITS-Server
【脱离复杂的环境配置和整合包，极简配置推理服务】从GPT-SoVITS项目里面提取出来的，纯粹的推理服务方案。
☆295Updated last year
swordswind / ai_virtual_mate_web
AI虚拟伙伴Web版
☆523Updated 3 weeks ago
libukai / Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
☆1,749Updated last year
lukeewin / AudioSeparationGUI
这是一款基于FunASR实现的说话人分离的GUI程序
☆112Updated 3 weeks ago
AlfreScarlet / MoeChat
一个超低延迟的基于GPT-SoVITS语音合成的语音交互系统
☆117Updated this week
YUANZHUO-BNU / metahuman_overview
数字人资料整理
☆940Updated 7 months ago
xiciliu / Awesome-ChatTTS-2
官方推荐的 ChatTTS 最佳入门指南，整理和汇总了常见问题和相关资源
☆100Updated last year
TOM88812 / xiaozhi-android-client
一个基于小智、xiaozhi-server的Android、IOS语音对话应用,支持实时语音交互和文字对话。现在是flutter版本，打通IOS、Android端。请同志们动动小手，点点小星星，予以鼓励。
☆931Updated 2 weeks ago
FunAudioLLM / FunAudioLLM-APP
☆365Updated last year
Tele-AI / TeleSpeech-ASR
☆740Updated last year
qi-hua / async_cosyvoice
使用vllm加速cosyvoice2的推理
☆386Updated 3 months ago
Kedreamix / Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…
☆2,824Updated 5 months ago
latiaoge / AI-Sphere-Butler
终极愿景：目标是创造一个全方位服务于用户全场景的 AI 全能管家AGI—“小粒”。除了不具备物理形态外，“小粒”将提供与远程视频通话中的真人几乎无异的体验，具备思考、情感交流、视觉、听觉以及模拟触觉反馈等能力，并能够游走在任何家庭、车辆等场景显示设备上自由与人交互。功能覆盖…
☆120Updated last month