This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining".
☆30Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for data2vec-KWS
Users that are interested in data2vec-KWS are comparing it to the libraries listed below
Sorting:
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- Test Framework for few-shot open set KWS☆41Nov 8, 2024Updated last year
- Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆185Dec 6, 2024Updated last year
- Few-Shot Keyword Spotting☆71Apr 11, 2021Updated 4 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆113Sep 14, 2022Updated 3 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆282May 23, 2022Updated 3 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- Recipe for LibriPhrase☆33Sep 2, 2023Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago
- Keyword spotting and forced alignment in any language☆92Feb 12, 2026Updated 2 weeks ago
- ☆89May 31, 2023Updated 2 years ago
- Official code for Metric learning for user-defined keyword spotting☆38Feb 21, 2024Updated 2 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆137Apr 29, 2022Updated 3 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆45Jan 24, 2026Updated last month
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆32Nov 9, 2025Updated 3 months ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆693Sep 17, 2025Updated 5 months ago
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆40Oct 11, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- ☆26Nov 3, 2025Updated 4 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- ☆135Sep 23, 2020Updated 5 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago