An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆269Nov 6, 2020Updated 5 years ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆395Jun 16, 2021Updated 4 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 5 months ago
- small experimentation about positional encoding☆20Feb 9, 2020Updated 6 years ago
- Simple python algorithms for segmenting animal (songbird, mice) vocalizations into notes and syllables using Dynamic Thresholding and Con…☆27Apr 12, 2021Updated 5 years ago
- A simple python script that, given a location and a date, uses the Nasa Earth API to show a photo taken by the Landsat 8 satellite. The s…☆44Apr 13, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于Tensorflow实现声音分类,博客地址:☆106May 8, 2020Updated 6 years ago
- A speech activity detector using HMMs☆11Feb 11, 2026Updated 2 months ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆59May 25, 2018Updated 7 years ago
- Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.…☆21Sep 14, 2018Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- ☆27Jan 6, 2023Updated 3 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 7 years ago
- music genre classification : LSTM vs Transformer☆63Mar 25, 2023Updated 3 years ago
- [TII 2022] Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems☆19Jul 21, 2024Updated last year
- spaCy wrapper for JSON-NLP.☆12Aug 11, 2019Updated 6 years ago
- compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size☆12Mar 24, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the data and code for the paper: Evaluating the Efficacy of Supervised Learning vs. Large Language Models for Identifying Cogniti…☆15Aug 3, 2025Updated 9 months ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…☆10Feb 12, 2020Updated 6 years ago
- Blind Source Separation and Dereverberation☆20Mar 26, 2021Updated 5 years ago
- Course site for Methods of Statistics☆10Dec 4, 2017Updated 8 years ago
- Repository of the ISMIR'24 paper "Cue Point Estimation using Object Detection"☆29Aug 19, 2024Updated last year
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago