On-device voice activity detection (VAD) powered by deep learning
☆248Mar 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- On-device noise suppression powered by deep learning☆86Updated this week
- On-device speech-to-text engine powered by deep learning☆477Apr 9, 2026Updated last week
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 months ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- On-device speaker recognition engine powered by deep learning☆42Updated this week
- On-device speaker diarization powered by deep learning☆69Updated this week
- On-device streaming text-to-speech engine powered by deep learning☆136Apr 3, 2026Updated last week
- Voice Activity Detection based on Deep Learning & TensorFlow☆370Mar 24, 2023Updated 3 years ago
- On-device Speech-to-Intent engine powered by deep learning☆698Apr 9, 2026Updated last week
- A curated list of awesome voice activity detection☆73Nov 22, 2024Updated last year
- benchmark for Speech-to-Intent engines☆17Mar 27, 2026Updated 2 weeks ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,741Mar 26, 2026Updated 3 weeks ago
- ☆17Jul 23, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- On-device streaming speech-to-text engine powered by deep learning☆661Updated this week
- Lightweight python library for speaker diarization in real time implemented in pytorch☆11Oct 12, 2022Updated 3 years ago
- On-device LLM Inference Powered by X-Bit Quantization☆309Mar 27, 2026Updated 2 weeks ago
- Python interface to the WebRTC Voice Activity Detector☆2,458Jul 4, 2024Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆461Jun 3, 2020Updated 5 years ago
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 3 months ago
- Hotword Detection (Wake Word Detection) Android library and sample codes☆11Apr 9, 2018Updated 8 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated last month
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆588Apr 2, 2024Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 5 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 6 years ago
- On-device wake word detection powered by deep learning☆4,782Apr 9, 2026Updated last week
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆51Apr 7, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆60Oct 22, 2025Updated 5 months ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆20Apr 22, 2025Updated 11 months ago
- A java wrapper around the WebRTC Voice Activity Detection library☆66Jul 7, 2021Updated 4 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago