kaistmm / Metric-UD-KWS
Official code for Metric learning for user-defined keyword spotting
☆22Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Metric-UD-KWS
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆23Updated 4 months ago
- ☆46Updated last year
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 3 years ago
- Test Framework for few-shot open set KWS☆24Updated 2 weeks ago
- ☆26Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆36Updated 5 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- ☆18Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 4 months ago
- ☆31Updated 2 years ago
- ☆62Updated last month
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆20Updated 7 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ☆22Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year
- ☆21Updated 2 weeks ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆63Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆64Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆23Updated 2 years ago
- ☆25Updated last week
- Recipe for LibriPhrase☆23Updated last year