felixchenfy / Speech-Commands-Classification-by-LSTM-PyTorchLinks

Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.

☆43

Alternatives and similar repositories for Speech-Commands-Classification-by-LSTM-PyTorch

Users that are interested in Speech-Commands-Classification-by-LSTM-PyTorch are comparing it to the libraries listed below

Sorting:

danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
ArchitParnami / Few-Shot-KWS
Few-Shot Keyword Spotting
☆65Updated 4 years ago
roman-vygon / triplet_loss_kws
Learning Efficient Representations for Keyword Spotting with Triplet Loss
☆111Updated 2 years ago
ranchlai / speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆92Updated 3 years ago
vineeths96 / Spoken-Keyword-Spotting
In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…
☆102Updated 2 years ago
aishoot / Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
☆93Updated 4 years ago
jarfo / gcommands
Speech Commands Recognition using end-to-end deep learning models in pytorch
☆27Updated 4 years ago
desh2608 / gmm-hmm-asr
Python implementation of simple GMM and HMM models for isolated digit recognition.
☆65Updated 4 years ago
liyongze / lstm_speaker_verification
☆35Updated 6 years ago
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
sonos / keyword-spotting-research-datasets
☆124Updated 4 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 6 years ago
cvqluu / nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…
☆44Updated 4 years ago
shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆91Updated 4 years ago
staplesinLA / denoising_DIHARD18
☆60Updated 4 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆40Updated 2 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆49Updated 6 years ago
biyoml / End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
☆32Updated 3 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆60Updated 4 years ago
georgesterpu / Taris
Transformer-based online speech recognition system with TensorFlow 2
☆26Updated 4 years ago
pragyak412 / Improving-Voice-Separation-by-Incorporating-End-To-End-Speech-Recognition
Implementing the paper -
☆19Updated 2 years ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Updated 3 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆114Updated 2 years ago
dr-pato / audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆109Updated last year
isadrtdinov / kws-attention
Attention-based model for keywords spotting
☆19Updated 3 years ago
mechanicalsea / spectra
Spectra extraction tutorials based on torch and torchaudio.
☆41Updated last year
KimJeongSun / SpecAugment_numpy_scipy
fast SpecAugmentation code with numpy and scipy
☆31Updated 6 years ago
cvqluu / TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
☆202Updated 5 years ago
lhwcv / DTLN_pytorch
Dual-signal Transformation LSTM Network, PyTorch,NCNN
☆77Updated last year
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆61Updated 5 years ago