Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆16Sep 20, 2024Updated last year
Alternatives and similar repositories for vadc
Users that are interested in vadc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated 2 years ago
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated 2 years ago
- mnn tts demo.☆19May 7, 2025Updated last year
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆17Nov 25, 2024Updated last year
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 4 years ago
- DBT spectral analysis scripts for matlab☆10May 27, 2018Updated 8 years ago
- mnn asr demo.☆27Mar 24, 2025Updated last year
- An algorithm for the rapid evaluation of Bessel functions based on precomputed expansions.☆11Apr 16, 2018Updated 8 years ago
- A simple VAD method☆11May 27, 2019Updated 7 years ago
- demos using speex☆12Apr 20, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆35Apr 14, 2026Updated last month
- Complex Bessel functions☆13Jun 26, 2019Updated 6 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Tflite VX Delegate i.MX Machine Learning☆12Mar 8, 2026Updated 3 months ago
- 使用ONNXRuntime部署Detic检测2万1千种类别的物体,包含C++和Python两个版本的程序☆17Aug 29, 2023Updated 2 years ago
- Bag-of-Features Acoustic Event Detection☆14Oct 5, 2016Updated 9 years ago
- Few-Shot Keyword Spotting☆73Apr 11, 2021Updated 5 years ago
- ☆14Jun 19, 2019Updated 6 years ago
- Matlab Mex implementation☆11May 2, 2016Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Jun 20, 2019Updated 6 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- A Long Short Term Memory neural network for time series prediction. Memory blocks contain one memory cell in each. Weights for the networ…☆15Sep 3, 2018Updated 7 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11May 14, 2026Updated 3 weeks ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- Caffe version of code for our paper "Joint unsupervised learning of deep representations and image clusters"☆16Jul 4, 2017Updated 8 years ago
- Noise cancellation, suppression☆13Apr 8, 2019Updated 7 years ago
- ☆12Oct 7, 2018Updated 7 years ago
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- ☆10Jul 18, 2024Updated last year
- 📄Source code variable naming using a seq2seq architecture☆10Mar 19, 2020Updated 6 years ago
- Keyword Spotting for detecting a word in an audio file☆17Jul 21, 2019Updated 6 years ago
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆31Oct 3, 2023Updated 2 years ago
- A tutorial on the delay and sum beamformer for microphone arrays☆18Jun 9, 2017Updated 9 years ago