IliaZenkov / sklearn-audio-classificationView external linksLinks
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below
Sorting:
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 7 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- [TII 2022] Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems☆17Jul 21, 2024Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- 100 Days of GPU Challenge☆25Nov 15, 2025Updated 2 months ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- Environmental sound classification using Deep Learning with extracted features☆168Jan 22, 2020Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 2 months ago
- Graph analysis of resting state eeg data using MNE and Networkx☆19Jun 4, 2018Updated 7 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆58May 25, 2018Updated 7 years ago
- ☆93Apr 1, 2024Updated last year
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- ☆26Jan 6, 2023Updated 3 years ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- Curated List of NLP tutorials☆30Feb 27, 2025Updated 11 months ago
- music genre classification : LSTM vs Transformer☆63Mar 25, 2023Updated 2 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆582Feb 6, 2023Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Apr 2, 2022Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- ☆11Aug 11, 2021Updated 4 years ago
- Synthetic Minority Over-sampling Technique, DOI: https://doi.org/10.1613/jair.953☆11May 17, 2023Updated 2 years ago
- BioAmp is an opensource project of a multichannel biopotential adquisition system for EEG, EMG, EOG and EOG signals.☆15Apr 11, 2022Updated 3 years ago
- Sora2 Watermark Remover - AI-powered video watermark removal tool using deep learning. Built with Next.js 15, ComfyUI API & advanced comp…☆24Oct 13, 2025Updated 4 months ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆36Feb 24, 2023Updated 2 years ago
- This is the PyNN code used in the paper titled "Multilayer Spiking Neural Network for audio samples classification using SpiNNaker", whic…☆32Dec 7, 2021Updated 4 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Aug 22, 2017Updated 8 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 4 years ago