Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆190Updated this week
Alternatives and similar repositories for cobra:
Users that are interested in cobra are comparing it to the libraries listed below
- Reproducible experimental protocols for multimedia (audio, video, text) database☆92Updated this week
- ONNX Inference of Pyannote Segmentation☆81Updated 3 weeks ago
- On-device noise suppression powered by deep learning☆64Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Voice Activity Detection (VAD) using deep learning.☆193Updated 5 years ago
- Open models for Coqui STT☆127Updated last year
- On-device speaker diarization powered by deep learning☆33Updated this week
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆348Updated 10 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆295Updated 2 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆202Updated 5 months ago
- Python bindings of WebRTC Audio Processing☆180Updated 4 months ago
- ☆38Updated 11 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆391Updated 2 months ago
- 🐸STT integration examples☆122Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆158Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆90Updated 8 months ago
- Predicts the level of noise and reverberation on your audiofiles☆143Updated 7 months ago
- Variational Bayes HMM over x-vectors diarization☆260Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- Tools for Speech Enhancement integrated with Kaldi☆405Updated last year
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Diarization scoring tools.☆232Updated last year
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆151Updated 7 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆102Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆293Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆144Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.☆288Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆235Updated 5 months ago
- An automatic speech recognition API☆48Updated this week