Utilizes ONNX Runtime for speech activity detection.
☆42Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for Voice-Activity-Detection-VAD-ONNX
Users that are interested in Voice-Activity-Detection-VAD-ONNX are comparing it to the libraries listed below
Sorting:
- Utilizes ONNX Runtime for audio denoising.☆116Dec 27, 2025Updated 2 months ago
- Transcribe subtitles and translate them offline with ease.☆40Jan 10, 2026Updated last month
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆25Sep 12, 2024Updated last year
- Running the F5-TTS by ONNX Runtime☆191Jan 7, 2026Updated 2 months ago
- ☆24Jan 5, 2026Updated 2 months ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- PASE: Phonologically Anchored Speech Enhancer☆43Dec 10, 2025Updated 2 months ago
- 目标检测算法主要包括:两类two-stage和one-stage 一类是two-stage,two-stage检测算法将检测问题划分为两个阶段,首先产生候选区域(region proposals),然后对候选区域分类(一般还需要对位置精修),这一类的典型代表是R-CNN…☆15Sep 5, 2021Updated 4 years ago
- ☆23Jul 17, 2024Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Oct 10, 2024Updated last year
- ☆14May 21, 2024Updated last year
- Benchmarking different VAD models on AVA-Speech dataset☆18May 21, 2023Updated 2 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- 基于ultralytics训练的行人跌倒检测模型☆19Jul 10, 2023Updated 2 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- 具有讯飞在线语音合成功能的虚幻引擎插件☆22Jul 5, 2024Updated last year
- Demonstration of running a native LLM on Android device.☆232Feb 28, 2026Updated last week
- 小智机器人服务端☆18Mar 25, 2025Updated 11 months ago
- CNN学生行为识别☆25Mar 8, 2022Updated 4 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- 基于MCP协议和LangChain框架实现的企业级AI多Agent多模态系统,包含RAG技术增强的知识检索能力。☆39Mar 10, 2025Updated 11 months ago
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆575Jan 18, 2026Updated last month
- ICSD Dataset☆41Jun 11, 2025Updated 8 months ago
- Model and application for deepfake detection using a hybrid approach (spatial + frequency-based)☆48Jan 8, 2026Updated 2 months ago
- 使用django+pyecharts+PP-Human开发的动态数据大屏, 有人流数据的采集入库, 打架、摔倒等事件警报,口罩检测等实用功能。边缘端版本使用onnx推理提升效率,服务端版本支持视频流推拉☆33May 3, 2023Updated 2 years ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆74Jan 14, 2026Updated last month
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆153Apr 29, 2025Updated 10 months ago
- ☆12Nov 23, 2021Updated 4 years ago
- 数字人授课录制系统——全新的微课视频的生成方案——UI☆44Jan 18, 2025Updated last year
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- 打架斗殴暴力行为检测系统源码和数据集:改进y打架斗殴暴力行为检测系统源码和数据集:改进yolo11-CSP-EDLANolo11-CSP-EDLAN☆13Nov 20, 2024Updated last year
- 基于改进YOLOv7&OpenCV的行人过马路速度与交通灯实时监测系统(源码&教程)☆11Dec 4, 2023Updated 2 years ago
- A simple template for your own self-hosted CV website☆12May 5, 2024Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Oct 11, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- pre-process script for timit data for dnn-aec works☆36Mar 3, 2022Updated 4 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆41Jul 17, 2023Updated 2 years ago