karolpiczak / ESC-50Links
ESC-50: Dataset for Environmental Sound Classification
☆1,713Updated last year
Alternatives and similar repositories for ESC-50
Users that are interested in ESC-50 are comparing it to the libraries listed below
Sorting:
- ☆1,648Updated last year
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,207Updated 2 weeks ago
- kapre: Keras Audio Preprocessors☆938Updated 2 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,407Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,517Updated 3 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,219Updated 4 years ago
- Implementation of the Wave-U-Net for audio source separation☆927Updated 2 years ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,754Updated 3 weeks ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Updated 4 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆937Updated last year
- Urban sound classification using Deep Learning☆523Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,124Updated last month
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆859Updated 3 months ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Updated 4 years ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,356Updated last year
- Audio processing by using pytorch 1D convolution network☆1,112Updated last month
- Code for YouTube series: Deep Learning for Audio Classification☆579Updated 2 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆866Updated 4 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,112Updated last year
- A free audio dataset of spoken digits. An audio version of MNIST.☆664Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,807Updated this week
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,510Updated 6 months ago
- An audio/acoustic activity detection and audio segmentation tool☆827Updated last year
- A flexible source separation library in Python☆643Updated last year
- The Munich Open-Source Large-Scale Multimedia Feature Extractor☆753Updated 2 months ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Updated 3 years ago
- ☆699Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,421Updated last year
- Voice Activity Detector in Python☆480Updated 5 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆697Updated 7 years ago