AI-Research-BD / Keyword-MLP
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Updated 2 years ago
Alternatives and similar repositories for Keyword-MLP:
Users that are interested in Keyword-MLP are comparing it to the libraries listed below
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆65Updated 2 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- ☆30Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 6 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Test Framework for few-shot open set KWS☆25Updated 2 months ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆43Updated 7 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 5 months ago
- ☆63Updated 4 months ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆40Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- ☆64Updated last year
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆17Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆22Updated 10 months ago
- ☆26Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- ☆25Updated 3 years ago
- ☆31Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆38Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year