thangdnsf / Audio-command-recognition
Audio command recognition by DTW and classification
☆7Updated 4 years ago
Alternatives and similar repositories for Audio-command-recognition:
Users that are interested in Audio-command-recognition are comparing it to the libraries listed below
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆24Updated 7 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Keyword Spotting for detecting a word in an audio file☆17Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- ☆23Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- A program to generate microphone wind noise audio. Ideal for generating example data for designing noise removal algorithms.☆17Updated 6 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- A temporal module for PyTorch-ComplexTensor☆45Updated 7 months ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆63Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago
- ☆15Updated 5 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆33Updated 4 years ago
- Audio Keyword Search☆12Updated 5 years ago
- Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong☆15Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Updated 2 years ago