Picovoice / cobraLinks
On-device voice activity detection (VAD) powered by deep learning
☆219Updated this week
Alternatives and similar repositories for cobra
Users that are interested in cobra are comparing it to the libraries listed below
Sorting:
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆422Updated 3 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆209Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Tunable pipelines☆34Updated 4 months ago
- On-device noise suppression powered by deep learning☆73Updated 2 weeks ago
- 🐸STT integration examples☆129Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆395Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆318Updated 7 months ago
- On-device speaker diarization powered by deep learning☆51Updated 2 weeks ago
- ☆40Updated last year
- A non-native English corpus for pronunciation scoring task☆143Updated 11 months ago
- openvino version of openai/whisper☆168Updated last year
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆168Updated last year
- Onnx wrapper for espnet infrernce model☆165Updated 9 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- This repository is a collection of TTS Models in TFLite☆195Updated 4 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆254Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆121Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆361Updated last year
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆337Updated last year
- 🐸 - A general purpose model trainer, as flexible as it gets☆220Updated last year