zycv / awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
☆253Updated 2 years ago
Alternatives and similar repositories for awesome-keyword-spotting:
Users that are interested in awesome-keyword-spotting are comparing it to the libraries listed below
- Towards hot directions in industrial end to end speech recognition☆327Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆103Updated 2 years ago
- Chinese keyword spotting model using LSTM RNN☆172Updated 6 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆195Updated 3 weeks ago
- Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices☆221Updated last year
- A CRF-based ASR Toolkit☆328Updated 6 months ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆98Updated 2 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆103Updated 2 years ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆511Updated last week
- Tools for Speech Enhancement integrated with Kaldi☆409Updated last year
- Moved to https://github.com/k2-fsa/icefall☆144Updated 2 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆375Updated 2 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆195Updated 10 months ago
- Kaldi model converter to ONNX☆240Updated 2 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆338Updated 4 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆172Updated 2 months ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆311Updated 4 years ago
- ☆122Updated 4 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆375Updated last year
- Voice Activity Detection (VAD) using deep learning.☆194Updated 5 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆127Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆358Updated last year
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Updated 4 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆133Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Variational Bayes HMM over x-vectors diarization☆266Updated last year
- A pure python module for reading and writing kaldi ark files☆252Updated last year
- ☆142Updated 4 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆117Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆276Updated last year