AI-Research-BD / Keyword-MLP
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Updated last year
Related projects: ⓘ
- Streaming Audiotransformers for online Audio tagging☆39Updated 3 months ago
- Test Framework for few-shot open set KWS☆21Updated 2 weeks ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆20Updated 3 years ago
- ☆26Updated last year
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆20Updated 2 months ago
- ☆24Updated last year
- ☆51Updated this week
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆37Updated last year
- ☆49Updated 3 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆61Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆35Updated 3 months ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆68Updated last year
- experiments about AudioSet☆43Updated last year
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆38Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 2 years ago
- ☆41Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆32Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆24Updated 2 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆20Updated 6 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆29Updated 5 months ago
- ☆16Updated 2 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆23Updated last month