henttttai / voice-to-voice-llm-structureLinks

自用，语音到文本用的sencevoice，llm部分基于ollama的API调用，文本到语音用的cosyvoice，实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。

☆9

Alternatives and similar repositories for voice-to-voice-llm-structure

Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below

Sorting:

RemSynch / SenseVoice-Real-Time
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆24Updated 9 months ago
season-studio / MeloTTS-ONNX
An implementation of MeloTTS by onnxruntime
☆23Updated 8 months ago
DakeQQ / Voice-Activity-Detection-VAD-ONNX
Utilizes ONNX Runtime for speech activity detection.
☆25Updated 2 weeks ago
xinhecuican / QSmartAssistant
一个模块化，全过程可离线，低占用率的对话机器人/智能音箱
☆88Updated 3 months ago
lovemefan / paraformer.cpp
Port of Funasr's Paraformer model in C/C++
☆32Updated last year
FeiGeChuanShu / FunASR-demo-ncnn
some ncnn demos of FunASR
☆25Updated 9 months ago
wangzhaode / mnn-asr
mnn asr demo.
☆20Updated 3 months ago
Tzenthin / wenet_mnn
语音识别模型pytorch转ONNX转MNN，C++实现部署
☆68Updated 2 years ago
jundaychan / funasr-fastapi
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆12Updated 10 months ago
hpc203 / Real-Time-Frame-Interpolation-onnxrun
使用onnxruntime部署实时视频帧插值，包含C++和Python两个版本的程序
☆25Updated last year
chenyangMl / keyword-spot
端到端语音唤醒工具箱，从模型训练到模型推理。
☆117Updated 9 months ago
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆95Updated 9 months ago
peilongchencc / My-FunASR
基于FunASR实现语音识别，包含常规版和ONNX版(推荐)。
☆41Updated 8 months ago
RapidAI / RapidTTS
A cross platform implementation of Text-to-Speech based on ONNXRuntime.
☆32Updated 2 years ago
DakeQQ / Audio-Denoiser-ONNX
Utilizes ONNX Runtime for audio denoising.
☆55Updated last week
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆102Updated 2 years ago
huakunyang / SummerAsr
SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…
☆95Updated 6 months ago
apinge / MeloTTS.cpp
A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.
☆66Updated 2 months ago
DakeQQ / Automatic-Speech-Recognition-ASR-ONNX
Utilizes ONNX Runtime to transcribe audio into text.
☆35Updated last week
WGS-note / F5_TTS_Faster
F5-TTS 推理加速，速度提升约4倍！
☆96Updated 5 months ago
huahuahuage / Bert-VITS2-Speech
Bert-VITS2 onnx推理版本
☆42Updated last year
jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆36Updated 8 months ago
yuyun2000 / SpeechDenoiser
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…
☆78Updated 10 months ago
wangzhaode / mnn-segment-anything
segment-anything based mnn
☆35Updated last year
swordswind / cosyvoice_simple_api
CosyVoice语音合成简易API
☆11Updated 7 months ago
Xiaolong-RRL / qwen2_5_vllm_fastapi
使用FastAPI+vLLM部署Qwen2.5
☆19Updated 9 months ago
esnya / realtime-whisper
ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆31Updated 6 months ago
lovemefan / paraformer-python
paraformer(chinense asr) online onnx runtime for python
☆46Updated last year
hpc203 / cv_resnet18_card_correction-opencv-dnn
使用opencv部署读光-票证检测矫正模型，包含C++和Python两个版本的程序，只依赖opencv库就能运行
☆15Updated 6 months ago
hfyydd / sherpa-onnx-server
☆24Updated 5 months ago