mrusci / ondevice-learning-kws
Test Framework for few-shot open set KWS
☆25Updated 3 months ago
Alternatives and similar repositories for ondevice-learning-kws:
Users that are interested in ondevice-learning-kws are comparing it to the libraries listed below
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆43Updated 8 months ago
- ☆31Updated 2 years ago
- ☆55Updated last year
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆24Updated 7 months ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆21Updated 2 months ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆23Updated 10 months ago
- Official code for Metric learning for user-defined keyword spotting☆28Updated 11 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- ☆30Updated last year
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- ☆26Updated last year
- Official repository of NeXt-TDNN for speaker verification☆65Updated 4 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆32Updated 6 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆36Updated 2 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆22Updated last month
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆29Updated 4 months ago
- ☆14Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 6 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated last year
- Pytorch implementation of BiFSMN, IJCAI 2022☆21Updated 2 years ago
- ☆43Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆12Updated last year
- Recipe for LibriPhrase☆27Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated 2 months ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- ☆41Updated 9 months ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆45Updated 3 weeks ago