thangdnsf / Audio-command-recognitionLinks

Audio command recognition by DTW and classification

☆7

Alternatives and similar repositories for Audio-command-recognition

Users that are interested in Audio-command-recognition are comparing it to the libraries listed below

Sorting:

ankitshah009 / WALNet-Weak_Label_Analysis
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Updated last year
wikke / AudioRecognition
Google Speech Command Dataset Classification Neural Network, CNN, RNN
☆25Updated 7 years ago
aishoot / Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
☆93Updated 4 years ago
joaoantoniocn / AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆30Updated last year
tabahi / formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
☆26Updated 3 years ago
sarthak268 / Audio_Classification_using_LSTM
Classification of Urban Sound Audio Dataset using LSTM-based model.
☆74Updated 2 years ago
joaoantoniocn / AM-SincNet
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆45Updated last year
mechanicalsea / spectra
Spectra extraction tutorials based on torch and torchaudio.
☆41Updated last year
alibugra / audio-data-augmentation
Audio data augmentation examples
☆34Updated 7 years ago
MLSpeech / speech_yolo
SpeechYOLO Interspeech 2019
☆44Updated 2 years ago
matthijsvk / multimodalSR
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
☆69Updated 2 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Updated 5 years ago
giusenso / seld-tcn
SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow
☆64Updated 4 years ago
anuragkr90 / weak_feature_extractor
☆59Updated 7 years ago
vbelz / audio_classification
Audio classification via transfer learning
☆33Updated 5 years ago
YashNita / sound-event-detection-winning-method
☆25Updated 6 years ago
shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆91Updated 4 years ago
vishalshar / Audio-Classification-using-CNN-MLP
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…
☆67Updated 4 years ago
jim-schwoebel / audioset_models
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
☆30Updated last year
mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆38Updated 7 years ago
swainshashwat / Audio-Classification-using-Deep-Learning
Classifying 10 different categories of Sound using Deep Learning.
☆25Updated 6 years ago
DeepLearn-lab / Acoustic-Feature-Fusion_Chime18
Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow
☆26Updated 6 years ago
dr-costas / SEDLM
Language modelling for sound event detection
☆20Updated 5 years ago
CVxTz / audio_classification
CNN 1D vs 2D audio classification
☆104Updated 6 years ago
dr-costas / dnd-sed
Sound event detection with depthwise separable and dilated convolutions.
☆53Updated 5 years ago
SIP-Lab / CNN-VAD
A Convolutional Neural Network based Voice Activity Detector for Smartphones
☆71Updated 6 years ago
georgesterpu / Taris
Transformer-based online speech recognition system with TensorFlow 2
☆26Updated 4 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
doerlbh / MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
☆27Updated 3 years ago
nipunmanral / Spoken-Language-Identification
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
☆26Updated 11 months ago