felixchenfy / Speech-Commands-Classification-by-LSTM-PyTorchLinks
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
☆43Updated 2 years ago
Alternatives and similar repositories for Speech-Commands-Classification-by-LSTM-PyTorch
Users that are interested in Speech-Commands-Classification-by-LSTM-PyTorch are comparing it to the libraries listed below
Sorting:
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- ☆63Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 10 months ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆45Updated 4 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆25Updated last year
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Test Framework for few-shot open set KWS☆31Updated 6 months ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆61Updated 5 years ago
- ☆21Updated 5 years ago
- Few-Shot Keyword Spotting☆64Updated 4 years ago
- ☆15Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆102Updated 6 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM☆47Updated 3 years ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆34Updated 4 years ago
- Python implementation of simple GMM and HMM models for isolated digit recognition.☆65Updated 4 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆111Updated 2 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆99Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 5 years ago
- Code for DCASE 2020 task 1a and task 1b.☆86Updated 3 years ago