Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆16Sep 20, 2024Updated last year
Alternatives and similar repositories for vadc
Users that are interested in vadc are comparing it to the libraries listed below
Sorting:
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated last year
- Convert ONNX models to plain C++ code (without dependencies)☆22Mar 27, 2023Updated 2 years ago
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆54Mar 15, 2025Updated last year
- ffmpeg+NCNN+QT视频人像抠图模型Intel CPU部署实时编解码☆10Apr 2, 2022Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- mnn tts demo.☆19May 7, 2025Updated 10 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 3 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1☆11Aug 8, 2017Updated 8 years ago
- A simple VAD method☆11May 27, 2019Updated 6 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆30Mar 9, 2026Updated last week
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Tflite VX Delegate i.MX Machine Learning☆11Mar 8, 2026Updated 2 weeks ago
- 使用ONNXRuntime部署Detic检测2万1千种类别的物体,包含C++和Python两个版本的程序☆17Aug 29, 2023Updated 2 years ago
- Bag-of-Features Acoustic Event Detection☆14Oct 5, 2016Updated 9 years ago
- ☆14Jun 19, 2019Updated 6 years ago
- A multilingual tool to convert PDF ebooks to audiobooks using XTTS v2 TTS model by cloning a speaker voice.☆18Jan 22, 2025Updated last year
- Matlab Mex implementation☆11May 2, 2016Updated 9 years ago
- ☆13Jun 20, 2019Updated 6 years ago
- DCASE2016 TASK1 Scene Classification☆12May 2, 2017Updated 8 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Mar 10, 2026Updated last week
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- Caffe version of code for our paper "Joint unsupervised learning of deep representations and image clusters"☆16Jul 4, 2017Updated 8 years ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech☆34May 1, 2024Updated last year
- ☆12Oct 7, 2018Updated 7 years ago
- battery frame☆19Jun 8, 2018Updated 7 years ago
- Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations☆43Nov 19, 2025Updated 4 months ago
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- 📄Source code variable naming using a seq2seq architecture☆10Mar 19, 2020Updated 6 years ago
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- Keyword Spotting for detecting a word in an audio file☆17Jul 21, 2019Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- A tutorial on the delay and sum beamformer for microphone arrays☆17Jun 9, 2017Updated 8 years ago
- a game framework. warning: wip, dev, unstable, radiation hazard, defcon 3☆24May 10, 2015Updated 10 years ago