DakeQQ / Voice-Activity-Detection-VAD-ONNXView external linksLinks
Utilizes ONNX Runtime for speech activity detection.
☆41Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for Voice-Activity-Detection-VAD-ONNX
Users that are interested in Voice-Activity-Detection-VAD-ONNX are comparing it to the libraries listed below
Sorting:
- Utilizes ONNX Runtime to transcribe audio into text.☆81Updated this week
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 2 months ago
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- PASE: Phonologically Anchored Speech Enhancer☆37Dec 10, 2025Updated 2 months ago
- Demonstration of combine YOLO and depth estimation on Android device.☆67Nov 15, 2025Updated 3 months ago
- Running the F5-TTS by ONNX Runtime☆191Jan 7, 2026Updated last month
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Jun 9, 2025Updated 8 months ago
- 基于飞桨pphuman中跨镜头跟踪的改进,主要是实现两路推流实时跟踪☆16Aug 14, 2023Updated 2 years ago
- ☆23Jul 17, 2024Updated last year
- ☆14May 21, 2024Updated last year
- 基于ultralytics训练的行人跌倒检测模型☆19Jul 10, 2023Updated 2 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- F5-TTS 推理加速,速度提升约4倍!☆123Jan 6, 2025Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- 具有讯飞在线语音合成功能的虚幻引擎插件☆22Jul 5, 2024Updated last year
- 小智机器人服务端☆18Mar 25, 2025Updated 10 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- 基于MCP协议和LangChain框架实现的企业级AI多Agent多模态系统,包含RAG技术增强的知识检索能力。☆38Mar 10, 2025Updated 11 months ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆561Jan 18, 2026Updated 3 weeks ago
- The official repo of UL-UNAS, an ultra-lightweight SE model.☆108Updated this week
- ICSD Dataset☆40Jun 11, 2025Updated 8 months ago
- ☆38Jan 20, 2025Updated last year
- An example of a speech enhancement model deployed with TensorRT.☆77Mar 24, 2025Updated 10 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆71Jan 14, 2026Updated last month
- A University Level Discrete Math and Theory 2 Course (Theory of Computation)☆13Dec 9, 2025Updated 2 months ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆151Apr 29, 2025Updated 9 months ago
- 打架斗殴暴力行为检测系统源码和数据集:改进y打架斗殴暴力行为检测系统源码和数据集:改进yolo11-CSP-EDLANolo11-CSP-EDLAN☆13Nov 20, 2024Updated last year
- 数字人授课录制系统——全新的微课视频的生成方案——UI☆44Jan 18, 2025Updated last year
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- A simple template for your own self-hosted CV website☆12May 5, 2024Updated last year
- A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)☆71May 27, 2025Updated 8 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Oct 11, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- Generate synthetic wind noise signals based on a wind speed profile (Python)☆48Apr 23, 2024Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆40Jul 17, 2023Updated 2 years ago
- pre-process script for timit data for dnn-aec works☆36Mar 3, 2022Updated 3 years ago
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago