henttttai / voice-to-voice-llm-structureLinks
自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆9Updated 6 months ago
Alternatives and similar repositories for voice-to-voice-llm-structure
Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below
Sorting:
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆24Updated 9 months ago
- An implementation of MeloTTS by onnxruntime☆23Updated 8 months ago
- Utilizes ONNX Runtime for speech activity detection.☆25Updated 2 weeks ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆88Updated 3 months ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- some ncnn demos of FunASR☆25Updated 9 months ago
- mnn asr demo.☆20Updated 3 months ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆68Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆12Updated 10 months ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆25Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆117Updated 9 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆95Updated 9 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆41Updated 8 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Utilizes ONNX Runtime for audio denoising.☆55Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆102Updated 2 years ago
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆95Updated 6 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆66Updated 2 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆35Updated last week
- F5-TTS 推理加速,速度提升约4倍!☆96Updated 5 months ago
- Bert-VITS2 onnx推理版本☆42Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆36Updated 8 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆78Updated 10 months ago
- segment-anything based mnn☆35Updated last year
- CosyVoice语音合成简易API☆11Updated 7 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆19Updated 9 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆31Updated 6 months ago
- paraformer(chinense asr) online onnx runtime for python☆46Updated last year
- 使用opencv部署读光-票证检测矫正模型,包含C++和Python两个版本的程序,只依赖opencv库就能运行☆15Updated 6 months ago
- ☆24Updated 5 months ago