karolpiczak / ESC-50Links
ESC-50: Dataset for Environmental Sound Classification
☆1,638Updated last year
Alternatives and similar repositories for ESC-50
Users that are interested in ESC-50 are comparing it to the libraries listed below
Sorting:
- ☆1,555Updated last year
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,333Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,124Updated last week
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,192Updated 4 years ago
- kapre: Keras Audio Preprocessors☆929Updated last year
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,411Updated 3 years ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,657Updated 3 months ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆388Updated 4 years ago
- Implementation of the Wave-U-Net for audio source separation☆906Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,076Updated 7 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,447Updated last month
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,274Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,450Updated 2 months ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆863Updated 4 years ago
- Audio processing by using pytorch 1D convolution network☆1,080Updated 3 months ago
- An audio/acoustic activity detection and audio segmentation tool☆798Updated 8 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆451Updated 5 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆684Updated 7 years ago
- ☆687Updated 11 months ago
- speech enhancement\speech seperation\sound source localization☆1,168Updated last year
- Code for YouTube series: Deep Learning for Audio Classification☆568Updated 2 years ago
- OpenL3: Open-source deep audio and image embeddings☆537Updated 2 years ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆513Updated 3 years ago
- A must-read paper for speech separation based on neural networks☆813Updated 3 weeks ago
- A free audio dataset of spoken digits. An audio version of MNIST.☆652Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆653Updated 3 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆429Updated last year
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆727Updated 2 years ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆544Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,000Updated 2 years ago