Picovoice / voice-activity-benchmarkLinks
Voice activity engine benchmark framework
☆18Updated 2 weeks ago
Alternatives and similar repositories for voice-activity-benchmark
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below
Sorting:
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆37Updated 5 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆76Updated 3 months ago
- ☆56Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 5 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆147Updated 4 months ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 5 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆61Updated 2 years ago
- ☆37Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆168Updated 2 months ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Small compression utility☆37Updated 6 months ago
- ☆29Updated 8 months ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Updated 3 years ago
- multilingual speech aligner☆77Updated last year
- ☆54Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆165Updated 3 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 4 months ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- Keyword spotting and forced alignment in any language☆74Updated last month
- ☆64Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 7 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- python wrapper for kaldi's native I/O☆27Updated 9 months ago
- An implementation of RNN-Transducer loss in TF-2.0.☆46Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆48Updated last year
- ☆34Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 4 months ago