henttttai / voice-to-voice-llm-structureLinks
自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆9Updated 6 months ago
Alternatives and similar repositories for voice-to-voice-llm-structure
Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below
Sorting:
- 使用FastAPI+vLLM部署Qwen2.5☆21Updated 9 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆67Updated 2 weeks ago
- Utilizes ONNX Runtime for speech activity detection.☆25Updated this week
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆29Updated 9 months ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆95Updated 4 months ago
- ☆15Updated this week
- Lightning-responsive CosyVoice2 streaming API based on FastAPI.☆11Updated last month
- combine ASR, LLM and TTS in local development with python☆12Updated 9 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆105Updated 2 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆74Updated 3 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆121Updated 10 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆96Updated 9 months ago
- 使用opencv部署读光-票证检测矫正模型,包含C++和Python两个版本的程序,只依赖opencv库就能运行☆18Updated 6 months ago
- Accelerating GOT-OCRv2 with VLLM☆9Updated 8 months ago
- ☆21Updated 4 months ago
- 使用opencv部署yolo11表格检测,它是百度网盘AI大赛-表格检测的第2名方案,方案里包含表格框检测,表格角点检测,表格方向分类,一共三个模块。我依然是编写了C++和Python两个版本的程序☆11Updated 7 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆41Updated 9 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆130Updated 2 months ago
- Solving puzzles with RWKV locally in your browser.☆12Updated 4 months ago
- 使用onnxruntime部署Gaze-LLE凝视目标估计,包含C++和Python两个版本的程序☆14Updated 5 months ago
- ☆201Updated 9 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆62Updated 4 months ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆24Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆100Updated 6 months ago
- 异步语音对话组件。☆24Updated 4 months ago
- 本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)☆143Updated 3 months ago
- CosyVoice语音合成简易API☆11Updated 8 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆41Updated this week
- Python的音频工具☆15Updated 8 months ago
- Python Wrapper of Silero VAD☆56Updated 2 months ago