henttttai / voice-to-voice-llm-structureLinks
自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆9Updated 5 months ago
Alternatives and similar repositories for voice-to-voice-llm-structure
Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below
Sorting:
- Utilizes ONNX Runtime for speech activity detection.☆24Updated this week
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆39Updated 7 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆94Updated 8 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆55Updated 2 months ago
- F5-TTS 推理加速,速度提升约4倍!☆92Updated 5 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆69Updated 2 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆23Updated 8 months ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆84Updated 3 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆30Updated this week
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆62Updated last month
- Python Wrapper of Silero VAD☆54Updated 3 weeks ago
- An implementation of MeloTTS by onnxruntime☆23Updated 7 months ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆16Updated 8 months ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆25Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆21Updated 4 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆18Updated 8 months ago
- 异步语音对话组件。☆21Updated 2 months ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 11 months ago
- paraformer(chinense asr) online onnx runtime for python☆44Updated last year
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆18Updated 6 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆54Updated 3 weeks ago
- CosyVoice语音合成简易API☆11Updated 7 months ago
- ASR_LLM_TTS前端项目☆14Updated 6 months ago
- Python的音频工具☆14Updated 6 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆116Updated 9 months ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆12Updated 9 months ago
- paraformer web server build with sanic☆24Updated 2 years ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆67Updated 2 years ago
- Just a suturing monster project.☆41Updated last year