ravising-h / Urbansound8kLinks
Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.
β73Updated last year
Alternatives and similar repositories for Urbansound8k
Users that are interested in Urbansound8k are comparing it to the libraries listed below
Sorting:
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β104Updated 2 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to idβ¦β69Updated 4 years ago
- General purpose sound recognition demoβ159Updated 2 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implementeβ¦β222Updated 2 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ168Updated 5 years ago
- β47Updated last year
- Feature extraction of speech signal is the initial stage of any speech recognition system.β96Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.β76Updated 3 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.β119Updated 2 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Eventsβ134Updated 8 months ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"β129Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of dataβ142Updated last year
- β101Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".β149Updated 2 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEEβ¦β205Updated 2 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflowβ66Updated 5 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β131Updated 5 years ago
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a β¦β253Updated 2 years ago
- Improved Wave-U-Net implemented in Pytorchβ359Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wiβ¦β92Updated 4 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learnβ145Updated 3 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.β344Updated 3 years ago
- Visualization toolbox for Sound Event Detectionβ123Updated last year
- Audio classification with VGGish as feature extractor in TensorFlowβ131Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2020 challengeβ56Updated 5 years ago
- CNN 1D vs 2D audio classificationβ106Updated 6 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorcβ¦β340Updated 5 years ago
- Audio transformations library for PyTorchβ233Updated 3 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.β65Updated last year
- Paderborn Sound Event Detectionβ76Updated 2 years ago