mashrurmorshed / Torch-KWT
Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.
☆36Updated 2 years ago
Alternatives and similar repositories for Torch-KWT:
Users that are interested in Torch-KWT are comparing it to the libraries listed below
- ☆50Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 6 months ago
- ☆55Updated last year
- ☆44Updated 4 years ago
- ☆31Updated 2 years ago
- Few-Shot Keyword Spotting☆63Updated 3 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆28Updated last month
- ☆32Updated 3 years ago
- ☆49Updated 2 years ago
- Conferencing Speech Challenge☆90Updated 3 years ago
- SpEx+(tied) source code☆77Updated last year
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆39Updated 5 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆21Updated 2 months ago
- speech-enhacement☆50Updated 5 years ago
- ☆58Updated 3 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆24Updated 7 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆37Updated 7 months ago
- DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…☆53Updated 2 years ago
- ☆41Updated 5 years ago
- Beam-guided TasNet☆49Updated 2 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Updated 3 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Updated 5 years ago
- ☆90Updated 3 years ago
- ☆32Updated 2 years ago
- A simple package for Guided source separation (GSS)☆114Updated 9 months ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆38Updated last year
- ☆29Updated 2 years ago