An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆264Nov 6, 2020Updated 5 years ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated last year
- A simple python script that, given a location and a date, uses the Nasa Earth API to show a photo taken by the Landsat 8 satellite. The s…☆44Apr 13, 2022Updated 3 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- 100 Days of GPU Challenge☆25Nov 15, 2025Updated 3 months ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- Environmental sound classification using Deep Learning with extracted features☆168Jan 22, 2020Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Graph analysis of resting state eeg data using MNE and Networkx☆20Jun 4, 2018Updated 7 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 3 months ago
- 基于CNN的音频识别☆17Feb 13, 2019Updated 7 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- small experimentation about positional encoding☆19Feb 9, 2020Updated 6 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- music genre classification : LSTM vs Transformer☆63Mar 25, 2023Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- Simple python algorithms for segmenting animal (songbird, mice) vocalizations into notes and syllables using Dynamic Thresholding and Con…☆27Apr 12, 2021Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Apr 2, 2022Updated 3 years ago
- Highly precise and universal compass and sensor-based solar tracker control with minimal hardware and full astronomic tracking for Concen…☆10Sep 4, 2022Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- ☆11Aug 11, 2021Updated 4 years ago
- BioAmp is an opensource project of a multichannel biopotential adquisition system for EEG, EMG, EOG and EOG signals.☆16Apr 11, 2022Updated 3 years ago
- Official Spring AI support for latest watsonx.ai services☆22Feb 24, 2026Updated last week
- Synthetic Minority Over-sampling Technique, DOI: https://doi.org/10.1613/jair.953☆11May 17, 2023Updated 2 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆32Dec 4, 2020Updated 5 years ago
- ⚡ Generate channel maps for Neuropixels probes that respect electrode-ADC wiring constraints! ⚡☆30Feb 21, 2026Updated 2 weeks ago
- This is the PyNN code used in the paper titled "Multilayer Spiking Neural Network for audio samples classification using SpiNNaker", whic…☆32Dec 7, 2021Updated 4 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- Multimodal Transformers for biomedical text and Knowledge Graph data☆34Mar 3, 2023Updated 3 years ago