aminul-huq / Speech-Command-Classification
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
☆9Updated 2 years ago
Alternatives and similar repositories for Speech-Command-Classification:
Users that are interested in Speech-Command-Classification are comparing it to the libraries listed below
- MSP-Podcast Challenge Baseline Code☆21Updated 10 months ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆14Updated 3 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆23Updated 4 months ago
- Multilingual datasets with raw audio for speech emotion recognition☆25Updated 3 years ago
- ☆29Updated 2 years ago
- ☆49Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆38Updated 9 months ago
- ☆11Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- BC-ResNet for Keyword Spotting☆38Updated 3 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated last year
- ☆13Updated 2 years ago
- ☆31Updated 2 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆42Updated 2 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆71Updated 3 years ago
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆37Updated 2 years ago
- ☆41Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆131Updated 3 months ago
- ☆108Updated 2 years ago
- ☆53Updated 4 years ago
- Text to Speech with PyTorch (English and Mongolian)☆12Updated 4 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- ☆21Updated 3 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆39Updated 9 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 7 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆60Updated last year
- e2e_antispoofing☆20Updated 3 years ago