Picovoice / voice-activity-benchmark
Voice activity engine benchmark framework
☆13Updated 2 years ago
Alternatives and similar repositories for voice-activity-benchmark:
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- ☆21Updated last week
- Went online decode demo☆29Updated 3 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆65Updated 2 years ago
- ☆56Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆64Updated 5 months ago
- Colab notebooks for Next-gen Kaldi☆26Updated this week
- Kaldi-compatible online fbank extractor without external dependencies☆87Updated 2 months ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Onnx wrapper for espnet infrernce model☆161Updated 4 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆24Updated 7 months ago
- ☆31Updated 10 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 7 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 4 months ago
- Online streaming speaker change detection model in Pytorch☆37Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- ☆9Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- How to use our public wav2vec2 age and gender model☆35Updated last year
- Clustering-based methods for overlapping diarization☆75Updated last year
- ☆38Updated this week
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆38Updated last year
- python wrapper for kaldi's native I/O☆27Updated last month
- Python bindings of speexdsp noise suppression library☆36Updated 2 years ago