Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆16Sep 20, 2024Updated last year
Alternatives and similar repositories for vadc
Users that are interested in vadc are comparing it to the libraries listed below
Sorting:
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated last year
- Convert ONNX models to plain C++ code (without dependencies)☆22Mar 27, 2023Updated 2 years ago
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- ffmpeg+NCNN+QT视频人像抠图模型Intel CPU部署实时编解码☆10Apr 2, 2022Updated 3 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated 3 weeks ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- mnn tts demo.☆19May 7, 2025Updated 9 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 3 years ago
- 使用ONNXRuntime部署Detic检测2万1千种类别的物体,包含C++和Python两个版本的程序☆17Aug 29, 2023Updated 2 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- 使用QT界面库,FFMPEG 做解码库,用于播放流媒体以及本地视频。本项目有一个特色就是透明视频的叠加。在流媒体的显示上覆盖一层本地视频。☆22May 29, 2018Updated 7 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆53Mar 15, 2025Updated 11 months ago
- mnn asr demo.☆25Mar 24, 2025Updated 11 months ago
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆30Aug 31, 2025Updated 6 months ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- 使用onnxruntime部署LYT-Net轻量级低光图像增强,包含C++和Python两个版本的程序☆29Jun 11, 2024Updated last year
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- A simple javascript utility library to include partial html (iframe alternate) without a framework or jQuery.☆17Oct 21, 2022Updated 3 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Aug 21, 2025Updated 6 months ago
- 小飞机翻墙教程☆24Nov 14, 2019Updated 6 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Few-Shot Keyword Spotting☆71Apr 11, 2021Updated 4 years ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech☆34May 1, 2024Updated last year
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆30Mar 6, 2025Updated 11 months ago
- A lightweight .NET Core console program to merge multiple TIFF files into one.☆12Jul 30, 2019Updated 6 years ago
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆35Feb 21, 2025Updated last year
- Its a stm32f4 discovery based project. It recognizes 6 spoken words and selects appropriate output☆35Jun 25, 2017Updated 8 years ago
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 3 years ago
- GUI for GHRepoSearcher. It allows to search online repositories on github.☆10May 20, 2022Updated 3 years ago
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆87Jan 15, 2024Updated 2 years ago
- Anthropic’s Model Context Protocol implementation for Oat++☆49Dec 13, 2024Updated last year
- ☆41May 19, 2023Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year