KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Keyword-Transformer
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- ☆13Updated 3 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- Mining effective negative training samples for keyword spotting (PyTorch)☆57Updated 4 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- ☆36Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- ☆53Updated 3 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆25Updated 4 months ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆43Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago