sahilsharma884 / Music-Genre-Classification
Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve 95% accuracy.
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Music-Genre-Classification
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆26Updated 4 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆21Updated 2 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆102Updated 3 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆50Updated last year
- In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifi…☆26Updated 10 months ago
- A pytorch implementation of Speech emotion recognition using deep 1D & 2D CNN LSTM networks☆24Updated last year
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆24Updated 3 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆64Updated 3 years ago
- Audio feature extraction and multi-classification with the ECS-10 data set☆21Updated 6 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆36Updated 8 months ago
- ☆52Updated 6 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Updated 4 years ago
- Audio classification via transfer learning☆32Updated 5 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆122Updated 4 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆84Updated last year
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆22Updated 3 months ago
- ☆53Updated 4 years ago
- Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques☆15Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆23Updated 3 years ago
- Pytorch code for our TOMM2022 paper "A Multimodal framework for large scale Emotion Recognition by Fusing Music and Electrodermal Activit…☆29Updated 2 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆66Updated 3 years ago
- Adversarial Auto-encoders for Speech Based Emotion Recogntion☆14Updated 6 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆11Updated 4 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated last year
- Reproduction of DepAudioNet by Ma et al. {DepAudioNet: An Efficient Deep Model for Audio based Depression Classification,(https://dl.acm.…☆66Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆123Updated 3 months ago
- ☆38Updated last year
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆72Updated 2 years ago