russellgeum / Temporal-Convolution-ResnetLinks
[Not Official] Implementation of TC-Resnet, INTERSPEECH 2019
☆21Updated last year
Alternatives and similar repositories for Temporal-Convolution-Resnet
Users that are interested in Temporal-Convolution-Resnet are comparing it to the libraries listed below
Sorting:
- ☆71Updated 2 years ago
- Test Framework for few-shot open set KWS☆33Updated 10 months ago
- ☆13Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆62Updated 5 years ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆33Updated 4 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆36Updated 4 months ago
- Went online decode demo☆31Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 2 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆132Updated 3 years ago
- Recipe for LibriPhrase☆31Updated 2 years ago
- ☆32Updated 3 years ago
- ☆33Updated 3 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆55Updated last year
- Few-Shot Keyword Spotting☆66Updated 4 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆28Updated last year
- Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM☆49Updated 3 years ago
- Official code for Metric learning for user-defined keyword spotting☆34Updated last year
- BC-ResNet for Keyword Spotting☆39Updated 3 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆35Updated 5 months ago
- ☆50Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆60Updated 2 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46Updated 2 years ago
- acnn for text-independent speaker recognition☆10Updated 3 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆110Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Updated 5 years ago
- Pytorch Models for Speech Enhancement☆22Updated 2 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago