Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆179Updated this week
Related projects ⓘ
Alternatives and complementary repositories for cobra
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- ☆34Updated 9 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆111Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated 2 weeks ago
- Open models for Coqui STT☆122Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 5 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- A tokenizer, text cleaner, and phonemizer for many human languages.☆285Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- Onnx wrapper for espnet infrernce model☆156Updated last month
- 🐸STT integration examples☆121Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆71Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆240Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 6 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆322Updated 9 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆126Updated 3 weeks ago
- On-device noise suppression powered by deep learning☆63Updated last month
- Diarization scoring tools.☆220Updated last year
- openvino version of openai/whisper☆161Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆187Updated 3 weeks ago
- How to create your own model for vosk☆64Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆225Updated 3 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆78Updated 3 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year