aizhiqi-work/MM-KWS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aizhiqi-work/MM-KWS)

aizhiqi-work / MM-KWS

Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"

☆51

Alternatives and similar repositories for MM-KWS

Users that are interested in MM-KWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aizhiqi-work / OpenKWS
View on GitHub
开源自定义唤醒词
☆17Dec 24, 2025Updated 7 months ago
ncsoft / PhonMatchNet
View on GitHub
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆63Jun 3, 2024Updated 2 years ago
kaistmm / Metric-UD-KWS
View on GitHub
Official code for Metric learning for user-defined keyword spotting
☆40Feb 21, 2024Updated 2 years ago
gusrud1103 / LibriPhrase
View on GitHub
Recipe for LibriPhrase
☆38Sep 2, 2023Updated 2 years ago
mrusci / ondevice-learning-kws
View on GitHub
Test Framework for few-shot open set KWS
☆45Nov 8, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
X-LANCE / KWStreamingSearch
View on GitHub
☆94Jun 25, 2025Updated last year
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
lugan113 / SynTTS-Commands-Official
View on GitHub
SynTTS-Commands is a large-scale, multilingual (English & Chinese) synthetic speech command dataset designed for low-power Keyword Spotti…
☆17Feb 5, 2026Updated 5 months ago
JethroWangSir / SincQDR-VAD
View on GitHub
☆26Aug 29, 2025Updated 11 months ago
swagshaw / TorchKWS
View on GitHub
Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.
☆41Apr 5, 2024Updated 2 years ago
audiolabs / SC-Wind-Noise-Generator
View on GitHub
Generate synthetic wind noise signals based on a wind speed profile (Python)
☆52Apr 23, 2024Updated 2 years ago
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lingjzhu / clap-ipa
View on GitHub
Keyword spotting and forced alignment in any language
☆100Jun 15, 2026Updated last month
Shybert-AI / AEC-Two-Stage-Based
View on GitHub
基于两阶段的声学回声消除系统 A Two-Stage-Based Acoustic Echo Cancellation System
☆17Feb 22, 2026Updated 5 months ago
Jokejiangv / LABNet
View on GitHub
The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…
☆49Oct 10, 2025Updated 9 months ago
wenet-e2e / wekws
View on GitHub
Production First and Production Ready End-to-End Keyword Spotting Toolkit
☆744Updated this week
modelscope / kws-training-suite
View on GitHub
☆162May 26, 2023Updated 3 years ago
Dahan-Wang / Adaptive-Convolution-for-CNN-based-Speech-Enhancement-Models
View on GitHub
☆16Feb 22, 2025Updated last year
htqin / BiFSMNv2
View on GitHub
Pytorch implementation of BiFSMNv2, TNNLS 2023
☆37Feb 10, 2023Updated 3 years ago
wdjose / keyword-transformer
View on GitHub
PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆16Jul 23, 2021Updated 5 years ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JusperLee / S4M
View on GitHub
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
☆28Feb 25, 2026Updated 5 months ago
Qualcomm-AI-research / bcresnet
View on GitHub
☆100May 31, 2023Updated 3 years ago
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
wangchengzhong / GRE-Net
View on GitHub
Official Repository for "Global Rotation Equivariant Phase Modeling for Speech Enhancement with Deep Magnitude-Phase Interaction"
☆19Jun 25, 2026Updated last month
jsvir / sparknet
View on GitHub
[Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting
☆20Aug 26, 2025Updated 11 months ago
zycv / awesome-keyword-spotting
View on GitHub
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
☆290May 23, 2022Updated 4 years ago
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆142Jan 28, 2026Updated 6 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆18Jun 16, 2026Updated last month
Tencent / StableToken
View on GitHub
[ICLR 2026] StableToken: A state-of-the-art noise-robust semantic speech tokenizer featuring Voting-LFQ for resilient SpeechLLMs.
☆33Feb 27, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
fliu215 / UDSE_Code
View on GitHub
☆29Updated this week
Kevin-naticl / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆105Apr 1, 2025Updated last year
RoyChao19477 / PCS
View on GitHub
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
☆73May 11, 2024Updated 2 years ago
AaltoAcousticsLab / aalto-datasets
View on GitHub
A list of datasets made available by members of the Aalto Acoustics Lab
☆31Sep 6, 2024Updated last year
ifnspaml / EC-Evaluation-Toolbox
View on GitHub
Toolbox for Evaluation of AEC/AES Systems
☆39Feb 18, 2026Updated 5 months ago
ArchitParnami / Few-Shot-KWS
View on GitHub
Few-Shot Keyword Spotting
☆73Apr 11, 2021Updated 5 years ago