ravising-h / Urbansound8k
Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.
☆68Updated last year
Alternatives and similar repositories for Urbansound8k
Users that are interested in Urbansound8k are comparing it to the libraries listed below
Sorting:
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- ☆21Updated 5 years ago
- ☆46Updated 8 months ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆103Updated last year
- Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.☆67Updated 4 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆219Updated 2 years ago
- Code for DCASE 2020 task 1a and task 1b.☆86Updated 3 years ago
- Environmental sound classification using Deep Learning with extracted features☆165Updated 5 years ago
- Scene Classification using Audio in the nearby Environment.☆19Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆74Updated 2 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆76Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆139Updated 10 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆55Updated 4 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆63Updated 4 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆127Updated 4 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆272Updated 3 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆128Updated 4 years ago
- ☆53Updated 4 years ago
- ☆299Updated 5 years ago
- Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF☆185Updated 6 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- simple delaysum, MVDR and CGMM-MVDR☆262Updated 6 years ago
- Paderborn Sound Event Detection☆74Updated last year
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆330Updated 2 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆130Updated last month
- CNN 1D vs 2D audio classification☆104Updated 6 years ago
- Introducing multi-channel U-Net for Music Source Separation trained using weighted multi-task loss.☆32Updated 2 years ago