wdjose / keyword-transformer
PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆15Updated 3 years ago
Alternatives and similar repositories for keyword-transformer:
Users that are interested in keyword-transformer are comparing it to the libraries listed below
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago
- ☆63Updated 5 months ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆24Updated 2 years ago
- acnn for text-independent speaker recognition☆9Updated 3 years ago
- The code for DCASE2021 task5 submission.☆20Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Few-Shot Keyword Spotting☆63Updated 3 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- ☆18Updated 2 years ago
- ☆21Updated 3 years ago
- ☆30Updated last year
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆67Updated 3 years ago
- ☆31Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- MultiSV: scripts for data preparation☆27Updated last month
- ☆32Updated 2 years ago
- ☆22Updated 3 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- ☆55Updated last year
- ☆49Updated 2 years ago