aminul-huq / Speech-Command-ClassificationLinks
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
☆9Updated 2 years ago
Alternatives and similar repositories for Speech-Command-Classification
Users that are interested in Speech-Command-Classification are comparing it to the libraries listed below
Sorting:
- A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.☆20Updated last week
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 3 years ago
- ☆107Updated 5 years ago
- Room impulse response simulator using python☆97Updated 5 years ago
- BC-ResNet for Keyword Spotting☆39Updated 3 years ago
- Clarity Challenge toolkit - software for building Clarity Challenge systems☆158Updated last week
- transformer based neural network for speech enhancement in time domain☆71Updated 3 years ago
- Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"☆125Updated 10 months ago
- Paderborn Sound Event Detection☆74Updated 2 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆204Updated 2 years ago
- ☆56Updated last year
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆63Updated 3 years ago
- ☆196Updated last year
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆92Updated 3 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆53Updated 5 months ago
- ☆37Updated last year
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆105Updated 4 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆23Updated 5 years ago
- ☆36Updated 3 years ago
- EVAR ~ Evaluation package for Audio Representations☆60Updated 3 weeks ago
- Python loaders for many Real Room Impulse Response databases☆91Updated 9 months ago
- TDY-CNN for text-independent speaker verification☆18Updated 2 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆118Updated 2 years ago
- ☆20Updated 4 years ago
- Machine and Deep Learning models for speech dereverberation☆116Updated 3 years ago
- Source code for LCN submission for ConferencingSpeech2022 challenge.☆14Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆195Updated 2 years ago
- ☆52Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆138Updated last year