wdjose / keyword-transformer
PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆14Updated 3 years ago
Related projects: ⓘ
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆20Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 2 years ago
- ☆26Updated last year
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- ☆21Updated 3 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 3 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- The code for DCASE2021 task5 submission.☆20Updated 2 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆16Updated 2 years ago
- ☆18Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated last year
- ☆13Updated this week
- ☆29Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆68Updated last year
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- ☆61Updated last week
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆27Updated last year
- ☆22Updated 2 years ago
- ☆28Updated 2 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆40Updated 4 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- ☆51Updated this week
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago