On-device voice activity detection (VAD) powered by deep learning
☆262Jun 18, 2026Updated this week
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- On-device noise suppression powered by deep learning☆90Jun 10, 2026Updated last week
- Voice activity engine benchmark framework☆23Jan 14, 2026Updated 5 months ago
- On-device speech-to-text engine powered by deep learning☆482Updated this week
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- On-device speaker recognition engine powered by deep learning☆49Jun 10, 2026Updated last week
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆30Jul 21, 2024Updated last year
- On-device speaker diarization powered by deep learning☆73Jun 10, 2026Updated last week
- On-device streaming text-to-speech engine powered by deep learning☆139Jun 10, 2026Updated last week
- Voice Activity Detection based on Deep Learning & TensorFlow☆372May 29, 2026Updated 2 weeks ago
- On-device Speech-to-Intent engine powered by deep learning☆703Updated this week
- A curated list of awesome voice activity detection☆75Nov 22, 2024Updated last year
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated 2 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆9,313Mar 26, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Jul 23, 2025Updated 10 months ago
- On-device streaming speech-to-text engine powered by deep learning☆664Jun 10, 2026Updated last week
- Lightweight python library for speaker diarization in real time implemented in pytorch☆11Oct 12, 2022Updated 3 years ago
- On-device LLM Inference Powered by X-Bit Quantization☆313Jun 10, 2026Updated last week
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆466Jun 3, 2020Updated 6 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,489Jul 4, 2024Updated last year
- Speaker diarization benchmark framework☆40Jun 10, 2026Updated last week
- Hotword Detection (Wake Word Detection) Android library and sample codes☆11Apr 9, 2018Updated 8 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆158Oct 26, 2021Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆17Nov 25, 2024Updated last year
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆597Apr 2, 2024Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- The rag pipeline for optimizing dynamic data editing.☆23Oct 30, 2025Updated 7 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 6 years ago
- On-device wake word detection powered by deep learning☆4,856Jun 10, 2026Updated last week
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- an Audio-Visual Voice Activity Detection using Deep Learning☆52Apr 7, 2019Updated 7 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A library of speech gadgets.☆15Oct 15, 2022Updated 3 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆20Apr 22, 2025Updated last year
- A java wrapper around the WebRTC Voice Activity Detection library☆67Jul 7, 2021Updated 4 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago