Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆206Updated this week
Alternatives and similar repositories for cobra:
Users that are interested in cobra are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆43Updated 3 weeks ago
- Onnx wrapper for espnet infrernce model☆162Updated 6 months ago
- ☆39Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆147Updated 11 months ago
- 🐸STT integration examples☆127Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆99Updated 2 months ago
- On-device noise suppression powered by deep learning☆69Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- ONNX Inference of Pyannote Segmentation☆84Updated 3 months ago
- Variational Bayes HMM over x-vectors diarization☆268Updated last year
- Voice Activity Detection (VAD) using deep learning.☆195Updated 5 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆247Updated last year
- Open models for Coqui STT☆136Updated last year
- Diarization scoring tools.☆240Updated 2 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆204Updated 8 months ago
- Tunable pipelines☆32Updated last month
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆245Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆367Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆410Updated last week
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 10 months ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆119Updated 3 years ago
- Various speech datasets made available to the public☆116Updated 4 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆125Updated 5 months ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆92Updated last week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- A curated list of awesome voice activity detection☆48Updated 4 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆334Updated 10 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated 2 weeks ago