Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆208Updated this week
Alternatives and similar repositories for cobra:
Users that are interested in cobra are comparing it to the libraries listed below
- ONNX Inference of Pyannote Segmentation☆86Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- On-device noise suppression powered by deep learning☆69Updated 2 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆248Updated 9 months ago
- ☆39Updated last year
- On-device speaker diarization powered by deep learning☆44Updated last month
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆63Updated last month
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆412Updated last month
- Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech☆344Updated 2 years ago
- 🐸STT integration examples☆127Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆162Updated 6 months ago
- Various speech datasets made available to the public☆116Updated 4 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆205Updated 9 months ago
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Diarization scoring tools.☆242Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 11 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆137Updated 4 months ago
- ☆359Updated 8 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆106Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆97Updated last week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆126Updated 5 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆378Updated last year
- Open models for Coqui STT☆138Updated last year
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- Large, modern dataset for speech recognition☆673Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆458Updated last year