An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆269Nov 6, 2020Updated 5 years ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆391Jun 16, 2021Updated 5 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- 基于CNN的音频识别☆18Feb 13, 2019Updated 7 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- Graph analysis of resting state eeg data using MNE and Networkx☆20Jun 4, 2018Updated 8 years ago
- small experimentation about positional encoding☆20Feb 9, 2020Updated 6 years ago
- Simple python algorithms for segmenting animal (songbird, mice) vocalizations into notes and syllables using Dynamic Thresholding and Con…☆27Apr 12, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆68Jan 8, 2021Updated 5 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆588Feb 6, 2023Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 6 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- This is the PyNN code used in the paper titled "Multilayer Spiking Neural Network for audio samples classification using SpiNNaker", whic…☆32Dec 7, 2021Updated 4 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆60May 25, 2018Updated 8 years ago
- Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.…☆22Sep 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 7 years ago
- Java sample codes on how to integrate with tensorflow☆13Mar 30, 2018Updated 8 years ago
- music genre classification : LSTM vs Transformer☆62Mar 25, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [TII 2022] Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems☆19Jul 21, 2024Updated last year
- ☆15May 28, 2020Updated 6 years ago
- spaCy wrapper for JSON-NLP.☆12Aug 11, 2019Updated 6 years ago
- compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size☆12Mar 24, 2018Updated 8 years ago
- This is the data and code for the paper: Evaluating the Efficacy of Supervised Learning vs. Large Language Models for Identifying Cogniti…☆16Aug 3, 2025Updated 10 months ago
- 100 Days of GPU Challenge☆26Nov 15, 2025Updated 7 months ago
- Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…☆10Feb 12, 2020Updated 6 years ago