aminul-huq / Speech-Command-Classification
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
☆9Updated 2 years ago
Alternatives and similar repositories for Speech-Command-Classification:
Users that are interested in Speech-Command-Classification are comparing it to the libraries listed below
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 7 months ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆68Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 6 months ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆55Updated 3 years ago
- ☆49Updated 2 years ago
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆62Updated 3 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆21Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆20Updated 8 months ago
- ☆41Updated 4 years ago
- BC-ResNet for Keyword Spotting☆35Updated 3 years ago
- ☆21Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆89Updated 3 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆42Updated 2 years ago
- ☆17Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Matlab tools for pathological voice analysis☆12Updated last year
- Paderborn Sound Event Detection☆72Updated last year
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated last year
- Wave U Net (NNabla)☆11Updated 4 years ago
- 语音增强☆15Updated 3 years ago
- PyTorch implementation of the LEAF audio frontend☆69Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 6 months ago
- e2e_antispoofing☆19Updated 3 years ago
- This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).☆29Updated last month
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆72Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆55Updated 5 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆48Updated 2 years ago
- ☆18Updated 2 years ago
- Submission to the HEAR2021 Challenge☆15Updated 2 years ago