An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆269Nov 6, 2020Updated 5 years ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆19Dec 20, 2023Updated 2 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- 基于CNN的音频识别☆18Feb 13, 2019Updated 7 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- small experimentation about positional encoding☆20Feb 9, 2020Updated 6 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆68Jan 8, 2021Updated 5 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆588Feb 6, 2023Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- A speech activity detector using HMMs☆11Feb 11, 2026Updated 3 months ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- This is the PyNN code used in the paper titled "Multilayer Spiking Neural Network for audio samples classification using SpiNNaker", whic…☆32Dec 7, 2021Updated 4 years ago
- Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.…☆22Sep 14, 2018Updated 7 years ago
- Implementation of SampleRNN for generating novel ambient music from raw audio source material☆10Jul 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 7 years ago
- Java sample codes on how to integrate with tensorflow☆13Mar 30, 2018Updated 8 years ago
- music genre classification : LSTM vs Transformer☆62Mar 25, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [TII 2022] Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems☆19Jul 21, 2024Updated last year
- compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size☆12Mar 24, 2018Updated 8 years ago
- spaCy wrapper for JSON-NLP.☆12Aug 11, 2019Updated 6 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- 100 Days of GPU Challenge☆26Nov 15, 2025Updated 6 months ago
- Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…☆10Feb 12, 2020Updated 6 years ago
- Blind Source Separation and Dereverberation☆21Mar 26, 2021Updated 5 years ago