vishalshar / Audio-Classification-using-CNN-MLP
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
☆66Updated 4 years ago
Alternatives and similar repositories for Audio-Classification-using-CNN-MLP:
Users that are interested in Audio-Classification-using-CNN-MLP are comparing it to the libraries listed below
- CNN 1D vs 2D audio classification☆104Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆72Updated 2 years ago
- Environmental sound classification using Deep Learning with extracted features☆164Updated 5 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆77Updated 7 years ago
- ☆21Updated 4 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆66Updated 11 months ago
- Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…☆13Updated 4 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆110Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 4 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆37Updated 2 years ago
- Multi-class audio classification with MFCC features using CNN☆27Updated 5 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆41Updated 5 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆55Updated 5 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆139Updated 5 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆128Updated 7 months ago
- ☆53Updated 4 years ago
- Audio classification via transfer learning☆33Updated 5 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆125Updated 4 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- Audio feature extraction and multi-classification with the ECS-10 data set☆19Updated 6 years ago
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆27Updated 4 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM☆115Updated 5 years ago
- ☆15Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆132Updated 6 months ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 4 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆103Updated 3 years ago