Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆48Jan 24, 2026Updated 2 months ago
Alternatives and similar repositories for MM-KWS
Users that are interested in MM-KWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆59Jun 3, 2024Updated last year
- Official code for Metric learning for user-defined keyword spotting☆38Feb 21, 2024Updated 2 years ago
- Recipe for LibriPhrase☆35Sep 2, 2023Updated 2 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆21Apr 1, 2022Updated 3 years ago
- Test Framework for few-shot open set KWS☆42Nov 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆32Aug 10, 2022Updated 3 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆37Apr 5, 2024Updated last year
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 7 months ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆35Feb 10, 2023Updated 3 years ago
- PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆16Jul 23, 2021Updated 4 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆186Dec 6, 2024Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆26Feb 25, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Aug 25, 2025Updated 7 months ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆31Mar 6, 2025Updated last year
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆703Sep 17, 2025Updated 6 months ago
- ☆91May 31, 2023Updated 2 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆283May 23, 2022Updated 3 years ago
- Few-Shot Keyword Spotting☆71Apr 11, 2021Updated 4 years ago
- ☆25Aug 29, 2025Updated 7 months ago
- Official repository of Fast-ULCNet.☆28Feb 4, 2026Updated last month
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Dec 1, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PrimeK-Net official code☆27Mar 5, 2025Updated last year
- Zero-Shot Blind Audio Bandwidth Extension☆27May 25, 2023Updated 2 years ago
- [Not Official] Implementation of TC-Resnet, INTERSPEECH 2019☆22Jan 24, 2024Updated 2 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 10 months ago
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆25Jul 30, 2025Updated 7 months ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- Keyword spotting and forced alignment in any language☆92Feb 12, 2026Updated last month
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆16Sep 21, 2023Updated 2 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 10 months ago
- PyTorch based toolkit for developing spiking neural networks (SNNs) by training and testing them on speech command recognition tasks☆30May 3, 2024Updated last year
- DPDFNet: causal single-channel speech enhancement that boosts DeepFilterNet2 with dual-path RNN blocks for stronger long-range temporal a…☆52Mar 20, 2026Updated last week