karolpiczak / ESC-50View external linksLinks
ESC-50: Dataset for Environmental Sound Classification
☆1,741Mar 20, 2024Updated last year
Alternatives and similar repositories for ESC-50
Users that are interested in ESC-50 are comparing it to the libraries listed below
Sorting:
- ESC: Dataset for Environmental Sound Classification - paper replication data☆83Dec 30, 2017Updated 8 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,419May 21, 2023Updated 2 years ago
- ☆1,662Jul 25, 2024Updated last year
- Environmental sound classification using Deep Learning with extracted features☆168Jan 22, 2020Updated 6 years ago
- EARS: Environmental Audio Recognition System☆121Apr 4, 2018Updated 7 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆470Sep 18, 2025Updated 4 months ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Sep 8, 2017Updated 8 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,233Dec 27, 2025Updated last month
- Repo associated to the DESED dataset, download and creation of data☆144Jul 16, 2024Updated last year
- Urban sound classification using Deep Learning☆523Sep 12, 2022Updated 3 years ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,370Jul 25, 2024Updated last year
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆697May 21, 2018Updated 7 years ago
- ☆59Apr 9, 2018Updated 7 years ago
- Convolutional neural networks for sound classification☆20Dec 30, 2017Updated 8 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,539Oct 6, 2025Updated 4 months ago
- speech enhancement\speech seperation\sound source localization☆1,223Nov 14, 2023Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,135Nov 24, 2025Updated 2 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,034Jul 5, 2023Updated 2 years ago
- A library for soundscape synthesis and augmentation☆413May 4, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆414Aug 14, 2022Updated 3 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,782Feb 4, 2026Updated last week
- Learning audio concepts from natural language supervision☆640Sep 18, 2024Updated last year
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆143Oct 6, 2023Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,527Jun 13, 2025Updated 8 months ago
- DCASE 2017 Baseline system☆82Jun 26, 2020Updated 5 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,216Aug 4, 2025Updated 6 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,232Apr 28, 2021Updated 4 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆134Apr 3, 2025Updated 10 months ago
- ESC: Dataset for Environmental Sound Classification☆25Oct 19, 2017Updated 8 years ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆576Jul 1, 2024Updated last year
- ☆231Feb 9, 2020Updated 6 years ago
- Evaluation toolbox for Sound Event Detection☆157Jun 12, 2024Updated last year
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,131Jun 6, 2024Updated last year