karolpiczak / ESC-50
ESC-50: Dataset for Environmental Sound Classification
☆1,487Updated 11 months ago
Alternatives and similar repositories for ESC-50:
Users that are interested in ESC-50 are comparing it to the libraries listed below
- Urban sound classification using Deep Learning☆514Updated 2 years ago
- Deep learning for audio denoising☆687Updated last year
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,226Updated last year
- ☆356Updated 11 months ago
- Code for YouTube series: Deep Learning for Audio Classification☆556Updated 2 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆661Updated 6 years ago
- ☆1,416Updated 7 months ago
- kapre: Keras Audio Preprocessors☆927Updated last year
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…☆350Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆647Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,594Updated 10 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,160Updated 3 years ago
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,970Updated this week
- spafe: Simplified Python Audio Features Extraction☆464Updated 8 months ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆385Updated 3 years ago
- Environmental sound classification using Deep Learning with extracted features☆165Updated 5 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,004Updated last month
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆504Updated 3 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆787Updated last month
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆512Updated 8 months ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆831Updated last year
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,162Updated 7 months ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆692Updated last year
- Audio processing by using pytorch 1D convolution network☆1,053Updated last year
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,849Updated 8 months ago
- A flexible source separation library in Python☆628Updated 2 months ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆918Updated 10 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆582Updated 3 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆492Updated 3 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆950Updated last year