An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
☆79Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for sklearn-audio-classification
Users that are interested in sklearn-audio-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆266Nov 6, 2020Updated 5 years ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Environmental sound classification using Deep Learning with extracted features☆168Jan 22, 2020Updated 6 years ago
- Urban sounds classification with Covnolutional Neural Networks☆37Nov 15, 2019Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 4 months ago
- small experimentation about positional encoding☆20Feb 9, 2020Updated 6 years ago
- A simple python script that, given a location and a date, uses the Nasa Earth API to show a photo taken by the Landsat 8 satellite. The s…☆44Apr 13, 2022Updated 3 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆584Feb 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- This is the PyNN code used in the paper titled "Multilayer Spiking Neural Network for audio samples classification using SpiNNaker", whic…☆32Dec 7, 2021Updated 4 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆59May 25, 2018Updated 7 years ago
- Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.…☆21Sep 14, 2018Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆10Jan 9, 2024Updated 2 years ago
- ☆27Jan 6, 2023Updated 3 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 6 years ago
- ☆14May 28, 2020Updated 5 years ago
- compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size☆12Mar 24, 2018Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Blind Source Separation and Dereverberation☆20Mar 26, 2021Updated 5 years ago
- Course site for Methods of Statistics☆10Dec 4, 2017Updated 8 years ago
- ☆95Apr 1, 2024Updated last year
- [TMM2022] Source codes of CENet☆40Mar 14, 2023Updated 3 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago
- Unsupervised feature learning for audio classification using convolutional deep belief networks☆12Jul 25, 2015Updated 10 years ago
- Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)☆28Jun 8, 2021Updated 4 years ago