IliaZenkov / sklearn-audio-classification
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆71Updated 3 years ago
Related projects: ⓘ
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆20Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆122Updated 2 years ago
- ☆84Updated last year
- Time series course Fall 2019 project☆52Updated 4 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆134Updated 3 years ago
- Pytorch implementation of deep audio embedding calculation☆93Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆88Updated 3 years ago
- ☆128Updated 3 weeks ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆97Updated last year
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆64Updated 3 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- ☆26Updated 2 years ago
- Matlab tools for pathological voice analysis☆11Updated last year
- music genre classification : LSTM vs Transformer☆61Updated last year
- ☆99Updated 4 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆22Updated 3 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆36Updated last year
- Repo associated to the DESED dataset, download and creation of data☆121Updated 2 months ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆69Updated 4 years ago
- ☆51Updated 6 years ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆65Updated 7 months ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 3 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆20Updated 2 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- open-source audio datasets☆141Updated last year
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 3 years ago
- Classifying Audio to Emotion☆27Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆164Updated 6 months ago