KrishnaDN / Keyword-TransformerLinks

Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"

☆23

Alternatives and similar repositories for Keyword-Transformer

Users that are interested in Keyword-Transformer are comparing it to the libraries listed below

Sorting:

AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Updated 3 years ago
isadrtdinov / kws-attention
Attention-based model for keywords spotting
☆19Updated 4 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 6 years ago
ArchitParnami / Few-Shot-KWS
Few-Shot Keyword Spotting
☆66Updated 4 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆62Updated 5 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆60Updated 4 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
dobby-seo / Pytorch-MHAtt-RNN-KWS
Multi-Head-Attention RNN pytorch implement for keyword spotting
☆21Updated 4 years ago
nii-yamagishilab / Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Updated 2 years ago
WangHelin1997 / SpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆33Updated 4 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆49Updated 6 years ago
georgesterpu / Taris
Transformer-based online speech recognition system with TensorFlow 2
☆26Updated 4 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
funcwj / voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
☆100Updated 2 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆61Updated 4 years ago
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆24Updated 5 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
RicherMans / Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
☆94Updated 2 years ago
nii-yamagishilab / Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…
☆55Updated 2 years ago
k2-fsa / multi_quantization
☆44Updated last year
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆29Updated 5 months ago
Hertin / WavPrompt
☆37Updated 3 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆55Updated 2 years ago
Jungjee / DcaseNet
Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…
☆42Updated 3 years ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
jindongwang / EasyEspnet
Making Espnet easier to use
☆56Updated 4 years ago
janson9192 / autokws2021
☆13Updated 4 years ago
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 5 years ago