On-device voice activity detection (VAD) powered by deep learning
☆245Mar 2, 2026Updated this week
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆83Updated this week
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- On-device speech-to-text engine powered by deep learning☆472Updated this week
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated last month
- A curated list of awesome voice activity detection☆73Nov 22, 2024Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆371Mar 24, 2023Updated 2 years ago
- On-device speaker diarization powered by deep learning☆69Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,279Feb 24, 2026Updated last week
- ☆17Jul 23, 2025Updated 7 months ago
- On-device Speech-to-Intent engine powered by deep learning☆698Updated this week
- On-device streaming text-to-speech engine powered by deep learning☆131Updated this week
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 2 weeks ago
- Python interface to the WebRTC Voice Activity Detector☆2,446Jul 4, 2024Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆457Jun 3, 2020Updated 5 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆579Apr 2, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- On-device streaming speech-to-text engine powered by deep learning☆660Updated this week
- On-device speaker recognition engine powered by deep learning☆41Updated this week
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 6 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆20Apr 22, 2025Updated 10 months ago
- 🐸TTS recipes for different datasets☆86Jul 26, 2022Updated 3 years ago
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- BioVoice: a multipurpose tool for voice analysis☆11Nov 13, 2020Updated 5 years ago
- Predicts the level of noise and reverberation on your audiofiles☆179Jun 17, 2025Updated 8 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 3 months ago
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Jul 25, 2024Updated last year
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Streamlit app to visualize and edit TTS datasets☆15Dec 15, 2021Updated 4 years ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 4 months ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆43Mar 23, 2022Updated 3 years ago
- 🐸STT integration examples☆130Sep 23, 2022Updated 3 years ago