karolpiczak / ESC-50
ESC-50: Dataset for Environmental Sound Classification
☆1,356Updated 5 months ago
Related projects: ⓘ
- Urban sound classification using Deep Learning☆512Updated 2 years ago
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,812Updated this week
- kapre: Keras Audio Preprocessors☆918Updated 10 months ago
- ☆1,317Updated last month
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆380Updated 3 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,115Updated last year
- Implementation of the Wave-U-Net for audio source separation☆824Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆639Updated 2 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆539Updated last year
- Audio processing by using pytorch 1D convolution network☆1,009Updated 7 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,124Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆924Updated 2 weeks ago
- The PyTorch-based audio source separation toolkit for researchers☆2,224Updated 2 months ago
- Environmental sound classification using Deep Learning with extracted features☆161Updated 4 years ago
- Deep learning for audio denoising☆644Updated 11 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,483Updated this week
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,419Updated last week
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆644Updated 6 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆422Updated 4 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,362Updated 2 years ago
- A flexible source separation library in Python☆604Updated last year
- A library for soundscape synthesis and augmentation☆375Updated 2 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆667Updated last year
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆835Updated 3 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆900Updated 5 months ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆809Updated last year
- ☆632Updated 3 months ago
- The Munich Open-Source Large-Scale Multimedia Feature Extractor☆570Updated 11 months ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆489Updated 3 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆733Updated 2 months ago