mashrurmorshed / Torch-KWTLinks

Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.

☆37

Alternatives and similar repositories for Torch-KWT

Users that are interested in Torch-KWT are comparing it to the libraries listed below

Sorting:

re9ulus / BC-ResNet
BC-ResNet for Keyword Spotting
☆39Updated 3 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆123Updated 3 years ago
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
ArchitParnami / Few-Shot-KWS
Few-Shot Keyword Spotting
☆66Updated 4 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆50Updated 4 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
☆33Updated 2 years ago
roman-vygon / triplet_loss_kws
Learning Efficient Representations for Keyword Spotting with Triplet Loss
☆111Updated 2 years ago
FFSVC / FFSVC2022_Baseline_System
☆32Updated 2 years ago
echocatzh / torch-mfcc
A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.
☆78Updated 2 years ago
mayank-git-hub / ETE-Speech-Recognition
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆26Updated last year
key2miao / TSTNN
transformer based neural network for speech enhancement in time domain
☆72Updated 3 years ago
ncsoft / PhonMatchNet
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆52Updated last year
Jasson-Chen / Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…
☆29Updated 3 years ago
Qualcomm-AI-research / bcresnet
☆66Updated 2 years ago
jymsuper / VAD_tutorial
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆41Updated 5 years ago
TowerYsable / speech_enhancement_awesome
☆22Updated 3 years ago
ranchlai / speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆93Updated 3 years ago
ConferencingSpeech / ConferencingSpeech2021
Conferencing Speech Challenge
☆95Updated 4 years ago
funcwj / aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆143Updated 2 years ago
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆29Updated 5 months ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆136Updated last year
ZhaZhaFon / resource_speech
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
☆53Updated 3 years ago
gemengtju / SpEx_Plus
SpEx+(tied) source code
☆86Updated 2 years ago
jsvir / vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆32Updated 4 months ago
seorim0 / DCCRN-with-various-loss-functions
DCCRN with various loss functions
☆96Updated 2 years ago
daniel03c1 / NAS_VAD
☆25Updated 9 months ago
linan2 / Voice-activity-detection-VAD-paper-and-code
Voice activity detection (VAD) paper and code（From 198*~ ）and its classification.
☆101Updated last month
desh2608 / gmm-hmm-asr
Python implementation of simple GMM and HMM models for isolated digit recognition.
☆66Updated 4 years ago
YUCHEN005 / DPSL-ASR
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆42Updated 2 years ago