Picovoice / cobraView external linksLinks
On-device voice activity detection (VAD) powered by deep learning
☆242Jan 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆82Jan 22, 2026Updated 3 weeks ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- On-device speech-to-text engine powered by deep learning☆471Feb 7, 2026Updated last week
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 weeks ago
- On-device speaker diarization powered by deep learning☆66Jan 22, 2026Updated 3 weeks ago
- A curated list of awesome voice activity detection☆71Nov 22, 2024Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆371Mar 24, 2023Updated 2 years ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,125Dec 30, 2025Updated last month
- ☆17Jul 23, 2025Updated 6 months ago
- On-device streaming text-to-speech engine powered by deep learning☆128Jan 22, 2026Updated 3 weeks ago
- On-device Speech-to-Intent engine powered by deep learning☆699Updated this week
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Dec 12, 2024Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,443Jul 4, 2024Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆456Jun 3, 2020Updated 5 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆578Apr 2, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learning☆655Feb 5, 2026Updated last week
- On-device speaker recognition engine powered by deep learning☆41Jan 22, 2026Updated 3 weeks ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Apr 22, 2025Updated 9 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 5 years ago
- 🐸TTS recipes for different datasets☆86Jul 26, 2022Updated 3 years ago
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆160Oct 26, 2021Updated 4 years ago
- BioVoice: a multipurpose tool for voice analysis☆11Nov 13, 2020Updated 5 years ago
- Predicts the level of noise and reverberation on your audiofiles☆177Jun 17, 2025Updated 7 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Jul 25, 2024Updated last year
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- The rag pipeline for optimizing dynamic data editing.☆19Oct 30, 2025Updated 3 months ago
- ☆13Sep 25, 2024Updated last year
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Streamlit app to visualize and edit TTS datasets☆15Dec 15, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- A java wrapper around the WebRTC Voice Activity Detection library☆66Jul 7, 2021Updated 4 years ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆43Mar 23, 2022Updated 3 years ago
- WebRTC-based Voice Activity Detection library☆133Oct 5, 2021Updated 4 years ago