On-device voice activity detection (VAD) powered by deep learning
☆250Apr 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- On-device noise suppression powered by deep learning☆87Apr 17, 2026Updated 3 weeks ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 months ago
- On-device speech-to-text engine powered by deep learning☆481Updated this week
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- On-device speaker recognition engine powered by deep learning☆42Apr 28, 2026Updated last week
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆30Jul 21, 2024Updated last year
- On-device speaker diarization powered by deep learning☆71Apr 17, 2026Updated 3 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆139Apr 17, 2026Updated 3 weeks ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆371Mar 24, 2023Updated 3 years ago
- On-device Speech-to-Intent engine powered by deep learning☆699Apr 17, 2026Updated 3 weeks ago
- A curated list of awesome voice activity detection☆74Nov 22, 2024Updated last year
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated last month
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,993Mar 26, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jul 23, 2025Updated 9 months ago
- On-device streaming speech-to-text engine powered by deep learning☆662Apr 18, 2026Updated 3 weeks ago
- Lightweight python library for speaker diarization in real time implemented in pytorch☆11Oct 12, 2022Updated 3 years ago
- On-device LLM Inference Powered by X-Bit Quantization☆311Apr 29, 2026Updated last week
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆464Jun 3, 2020Updated 5 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,472Jul 4, 2024Updated last year
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 4 months ago
- Hotword Detection (Wake Word Detection) Android library and sample codes☆11Apr 9, 2018Updated 8 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆16Nov 25, 2024Updated last year
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆591Apr 2, 2024Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 6 years ago
- On-device wake word detection powered by deep learning☆4,805Apr 17, 2026Updated 3 weeks ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- an Audio-Visual Voice Activity Detection using Deep Learning☆52Apr 7, 2019Updated 7 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆60Oct 22, 2025Updated 6 months ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆20Apr 22, 2025Updated last year
- A java wrapper around the WebRTC Voice Activity Detection library☆67Jul 7, 2021Updated 4 years ago