owenliang / wakeword-torchLinks

☆12

Alternatives and similar repositories for wakeword-torch

Users that are interested in wakeword-torch are comparing it to the libraries listed below

Sorting:

0x5446 / api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆462Updated 8 months ago
ABexit / ASR-LLM-TTS
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…
☆809Updated 3 months ago
lansinuote / Chinese_Speech_to_Text
☆17Updated last year
KMnO4-zx / extract-dialogue
从小说中提取对话数据集
☆205Updated last year
HaxxorCialtion / ASR_LLM_TTS_py
combine ASR, LLM and TTS in local development with python
☆12Updated 9 months ago
south20 / ChatGLM3_Lora_Fine-tune
本项目对ChatGLM3-6B通过多种方式微调，使模型具备落地潜质（包括但不限于客服、聊天、游戏）
☆34Updated last year
IronSpiderMan / MachineLearningPractice
机器学习实战案例，涉及机器学习、深度学习等各个方向。每个案例代码量在百行左右。
☆207Updated 3 weeks ago
Linear95 / bert-intent-slot-detector
BERT-based intent and slots detector for chatbots.
☆192Updated 4 months ago
TommyZihao / openvino_tonypi
基于OpenVINO，本地部署大模型智能体Agent，控制TonyPi人形机器人
☆137Updated 3 weeks ago
NanGePlus / GraphragTest
提供了一种gpt大模型平替解决方案实现利用非gpt大模型去使用Graphrag，支持多类型大模型如本地大模型(Ollama)、阿里云通义千问、百度文心千帆、智谱ChatGML、讯飞星火认知、Ollama、Moonshot AI、Google Gemini等。示例代码使用阿里…
☆321Updated 7 months ago
BiboyQG / bob-cosyvoice
A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.
☆37Updated 10 months ago
GuoCoder / ai-app
本项目旨在分享人工智能相关应用技术以及实战经验，包括大模型、语音合成、数字人、图像生成等。
☆246Updated 9 months ago
480284856 / AsyncAudioChat
异步语音对话组件。
☆22Updated 3 months ago
Ikaros-521 / RealtimeSTT_LLM_TTS
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆397Updated 5 months ago
emVisible / emRag
基于LangChain + Xinference + Chroma构建的本地知识库
☆11Updated 2 weeks ago
pengzhendong / streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
☆297Updated 3 months ago
Tele-AI / TeleSpeech-ASR
☆720Updated last year
ultrasev / stream-whisper
基于 faster-whisper 的伪实时语音转写服务
☆217Updated 2 months ago
mzc421 / Pytorch-NLP
使用Pytorch框架对NLP方向上的文本分类、实体识别、三元组抽取做代码实战
☆187Updated last year
NanGePlus / RagLangChainTest
在本项目中模拟健康档案私有知识库构建和检索全流程，通过一份代码实现了同时支持多种大模型（如OpenAI、阿里通义千问等）的RAG（检索增强生成）功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库；在线步骤:获取用户问题->用户问题向量化->检索向量数据库…
☆142Updated 9 months ago
owenliang / agent
qwen ai agent
☆134Updated last year
iamZhaoHang / VLM-ROS
为了实现真正的All in Local！我将Llava视觉大模型、QWen2.5-VL多模态大模型，以及STT和TTS模型全部部署在本地计算机上，打造了一个完全离线的机器人视觉交互系统。机器人通过摄像头感知周围环境，LLaVA和QWen2.5-VL进行视觉分析，STT进…
☆14Updated 2 months ago
RemSynch / SenseVoice-Real-Time
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆24Updated 9 months ago
qi-hua / async_cosyvoice
使用vllm加速cosyvoice2的推理
☆344Updated 2 months ago
xaio6 / LabelQuick
一种快速、轻松的AI辅助标注工具LabelQuick
☆232Updated 4 months ago
TommyZihao / vlm_arm
机械臂+大模型+多模态=人机协作具身智能体
☆849Updated this week
5zjk5 / prompt-engineering
prompt 工程项目案例
☆85Updated 3 months ago
BinNong / llm-graph-builder
Neo4j graph construction from unstructured data
☆311Updated 10 months ago
wangxb96 / RAG-QA-Generator
RAG-QA-Generator 是一个用于检索增强生成（RAG）系统的自动化知识库构建与管理工具。该工具通过读取文档数据，利用大规模语言模型生成高质量的问答对（QA对），并将这些数据插入数据库中，实现RAG系统知识库的自动化构建和管理。
☆200Updated 6 months ago
lukeewin / AudioSeparationGUI
这是一款基于FunASR实现的说话人分离的GUI程序
☆95Updated 4 months ago