KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 3 years ago
Related projects: ⓘ
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆20Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- ☆51Updated this week
- ☆13Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆37Updated last year
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- End-to-end diarization loss☆19Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆9Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- ☆35Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆25Updated 3 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆12Updated 3 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆55Updated 4 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆29Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆26Updated 5 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆43Updated last year
- ☆63Updated this week
- ☆20Updated 3 years ago
- ☆28Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆14Updated 3 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- ☆22Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago