An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆267Nov 6, 2020Updated 5 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆394Jun 16, 2021Updated 4 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Environmental sound classification using Deep Learning with extracted features☆168Jan 22, 2020Updated 6 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- small experimentation about positional encoding☆20Feb 9, 2020Updated 6 years ago
- Simple python algorithms for segmenting animal (songbird, mice) vocalizations into notes and syllables using Dynamic Thresholding and Con…☆27Apr 12, 2021Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Toolkit to asses speech impairments in patients with neurological disorders☆59May 25, 2018Updated 7 years ago
- Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.…☆21Sep 14, 2018Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- This is the data and code for the paper: Evaluating the Efficacy of Supervised Learning vs. Large Language Models for Identifying Cogniti…☆14Aug 3, 2025Updated 8 months ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆27Jan 6, 2023Updated 3 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 7 years ago
- Java sample codes on how to integrate with tensorflow☆13Mar 30, 2018Updated 8 years ago
- music genre classification : LSTM vs Transformer☆63Mar 25, 2023Updated 3 years ago
- ☆15May 28, 2020Updated 5 years ago
- [TII 2022] Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems☆17Jul 21, 2024Updated last year
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…☆10Feb 12, 2020Updated 6 years ago
- A production-grade deep learning system for automated skin lesion classification using the HAM10000 dataset. This system provides trainin…☆13Dec 24, 2025Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Blind Source Separation and Dereverberation☆20Mar 26, 2021Updated 5 years ago
- Course site for Methods of Statistics☆10Dec 4, 2017Updated 8 years ago
- ☆95Apr 1, 2024Updated 2 years ago
- Repository of the ISMIR'24 paper "Cue Point Estimation using Object Detection"☆28Aug 19, 2024Updated last year
- Unsupervised feature learning for audio classification using convolutional deep belief networks☆12Jul 25, 2015Updated 10 years ago
- Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)☆28Jun 8, 2021Updated 4 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆49Aug 2, 2024Updated last year