ravising-h / Urbansound8k
Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.
β68Updated last year
Alternatives and similar repositories for Urbansound8k:
Users that are interested in Urbansound8k are comparing it to the libraries listed below
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to idβ¦β66Updated 4 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β102Updated last year
- General purpose sound recognition demoβ156Updated last year
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.β112Updated 2 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflowβ63Updated 4 years ago
- Audio classification with VGGish as feature extractor in TensorFlowβ128Updated 3 years ago
- β43Updated 7 months ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.β75Updated 2 years ago
- CNN 1D vs 2D audio classificationβ104Updated 6 years ago
- Repo associated to the DESED dataset, download and creation of dataβ138Updated 9 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β129Updated 4 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ165Updated 5 years ago
- Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.β66Updated 4 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implementeβ¦β218Updated 2 years ago
- β107Updated 4 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection wβ¦β191Updated 2 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)β73Updated 3 years ago
- β21Updated 5 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learnβ143Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ65Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.β53Updated 5 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Eventsβ128Updated 3 weeks ago
- Baseline method for sound event localization task of DCASE 2020 challengeβ55Updated 4 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitβ¦β126Updated 4 years ago
- Baseline systems for the FSD50K datasetβ69Updated 3 years ago
- Baseline of DCASE 2020 task 4β43Updated 2 years ago
- β53Updated 4 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.β37Updated 3 years ago
- Benchmark for sound event localization task of DCASE 2019 challengeβ76Updated 4 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"β143Updated 5 years ago