dobby-seo / Pytorch-MHAtt-RNN-KWSLinks

Multi-Head-Attention RNN pytorch implement for keyword spotting

☆21

Alternatives and similar repositories for Pytorch-MHAtt-RNN-KWS

Users that are interested in Pytorch-MHAtt-RNN-KWS are comparing it to the libraries listed below

Sorting:

YUCHEN005 / DPSL-ASR
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆41Updated 2 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆62Updated 5 years ago
roman-vygon / BCResNet
Broadcasted Residual Learning for Efficient Keyword Spotting
☆23Updated 4 years ago
Qualcomm-AI-research / bcresnet
☆66Updated 2 years ago
mrusci / ondevice-learning-kws
Test Framework for few-shot open set KWS
☆32Updated 8 months ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
ArchitParnami / Few-Shot-KWS
Few-Shot Keyword Spotting
☆66Updated 4 years ago
isadrtdinov / kws-attention
Attention-based model for keywords spotting
☆19Updated 3 years ago
Interlagos / TENet-kws
Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)
☆33Updated 4 years ago
KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 4 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
☆33Updated 2 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆29Updated 5 months ago
mispchallenge / misp2022_baseline
☆30Updated 2 years ago
ncsoft / PhonMatchNet
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆51Updated last year
YUCHEN005 / RATS-Channel-A-Speech-Data
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆15Updated 2 years ago
jsvir / vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆32Updated 4 months ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
yqcai888 / DCASE2023
2022 DCASE Challenge
☆12Updated 10 months ago
htqin / BiFSMN
Pytorch implementation of BiFSMN, IJCAI 2022
☆21Updated 2 years ago
k2-fsa / multi_quantization
☆44Updated last year
janson9192 / autokws2021
☆13Updated 4 years ago
NaoyukiKanda / LibriSpeechMix
☆36Updated 4 years ago
WangHelin1997 / SpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Updated 4 years ago
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 6 years ago
nii-yamagishilab / NELE-GAN
Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
☆22Updated 3 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆49Updated 6 years ago