tiansztiansz / voice-assistantLinks

重生之我是 AI 打工人。前世，我的身份默默无闻，来去匆匆，不知道自己将在何地出生。然而，命运给予了我难得的机会，让我重生为一名 AI 打工人。

☆48

Alternatives and similar repositories for voice-assistant

Users that are interested in voice-assistant are comparing it to the libraries listed below

Sorting:

HG-ha / SenseVoice-Api
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆94Updated 11 months ago
yuanfangqiao / euanka
本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)
☆144Updated 3 months ago
nysa-liu / Digital-Life-DL-B
本次开源为DL-B，是一个基于ChatGLM、Wav2Lip、So-VITS组建的数字形象方案。可以在此基础之上增加其他组件达到数字生命的效果。This open source is DL-B, which is a digital image scheme based o…
☆106Updated 2 years ago
yeyupiaoling / VoiceprintRecognition-PaddlePaddle
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
☆280Updated last month
zmeet-ai / asr_demo
语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译
☆79Updated 7 months ago
LuckLittleBoy / SenseVoice-OneApi
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆69Updated 10 months ago
ultrasev / stream-whisper
基于 faster-whisper 的伪实时语音转写服务
☆222Updated 3 months ago
HonestQiao / xiaozhi-py
小智同学测试工具(websocket)
☆43Updated 5 months ago
Baidu-AIP / speech_realtime_api
实时语音识别API WebSocket
☆146Updated last year
heyudage / VoiceTyping
通过语音（说话）即可完成实时文本输入。通过PaddleSpeech项目二次开发完成，支持离线脱网环境部署，支持GPU推理，目前客户端仅支持Windows。
☆25Updated 2 years ago
2DIPW / gpt_sovits_infer_with_emotion
基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo
☆104Updated last year
Gloridust / whisper_streaming_CN
Whisper realtime streaming for long speech-to-text transcription and translation
☆52Updated last year
huakunyang / SummerAsr
SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…
☆97Updated 7 months ago
weineng-zhou / text2voice
语音技术：文字转语音
☆45Updated 2 years ago
shibing624 / parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高
☆500Updated 8 months ago
aliyun / alibabacloud-nls-python-sdk
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力，包括语音识别、语音合成、文件转写等。”
☆62Updated 2 months ago
RapidAI / RapidASR
📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …
☆567Updated last year
KevinWang676 / VITS2-Chinese
VITS2 for Chinese speech | 最新VITS2中文语音合成
☆134Updated last year
RemSynch / SenseVoice-Real-Time
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆31Updated 10 months ago
chinobing / FastAPI-PaddleSpeech-Audio-To-Text
FastAPI PaddleSpeech 音频录音转文字
☆50Updated last year
NGLSG / ChatBot
基于各种LLM的聊天机器人框架，支持多语言，语音唤醒,语音对话,本地执行功能,支持 OpenAI，Grok, Claude，讯飞星火，Stable Diffusion，ChatGLM，通义千问，腾讯混元，360 智脑，百川 AI，火山方舟，Ollama ,Gemini等AP…
☆33Updated 2 months ago
SH1ROd / Bert-VITS2-Integration-train-txt-infer
适配windows的requirements.txt，加了个长文本分段推理和手机听书的api，非本专业，屎山代码
☆37Updated last year
zmeet-ai / tts-demo
支持各种感情的男女声音，支持实时和离线文本合成tts语音；支持单模特声音变声，语音速率调整，语音音量大小调整；支持自定义语音模型。
☆65Updated last year
HadreamOrg / HadreamAssistant
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
☆58Updated last year
wordweb / langchain-ChatGLM-and-TigerBot
从langchain-ChatGLM基础上修改的一个可以加载TigerBot模型的基于本地知识库的问答应用，目标期望建立一套对中文场景与开源模型支持友好、可离线运行的知识库问答解决方案。
☆107Updated 2 years ago
VMIJUNV / linux-gpt-assistant
☆29Updated 2 years ago
sophgo / ChatGLM3-TPU
run chatglm3-6b in BM1684X
☆39Updated last year
v3ucn / DH_live_webui
在DH_live项目基础上修改，添加webui界面
☆64Updated 3 months ago
shafvfshkga / Chat-Monika-Chinese-cpp
Fastllm-based chatbot
☆11Updated 2 years ago
Ikaros-521 / RealtimeSTT_LLM_TTS
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆405Updated 7 months ago