AI-Research-BD / Keyword-MLP
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Updated 2 years ago
Alternatives and similar repositories for Keyword-MLP:
Users that are interested in Keyword-MLP are comparing it to the libraries listed below
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- ☆30Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 8 months ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆37Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- ☆64Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆81Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- Test Framework for few-shot open set KWS☆29Updated 4 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- ☆27Updated 2 years ago
- ☆43Updated 2 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- experiments about AudioSet☆44Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago