unixpickle / audiosetLinks
Fetch and use Google's AudioSet dataset
☆126Updated 8 years ago
Alternatives and similar repositories for audioset
Users that are interested in audioset are comparing it to the libraries listed below
Sorting:
- DCASE 2017 Baseline system☆82Updated 4 years ago
- Evaluation toolbox for Sound Event Detection☆147Updated 11 months ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- ☆130Updated 6 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆211Updated last year
- Repo associated to the DESED dataset, download and creation of data☆139Updated 10 months ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆130Updated 2 months ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆76Updated 4 years ago
- Deep Attractor Network (DANet) for single-channel speech separation☆76Updated 6 years ago
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282☆95Updated 7 years ago
- A pytorch implementation of xvector embedding☆79Updated 5 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Visualization toolbox for Sound Event Detection☆120Updated last year
- DCASE2019 Challenge Task 1 baseline system☆20Updated 5 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 2 years ago
- DCASE 2016 Baseline system, python implementation☆51Updated 7 years ago
- Baseline of dcase 2019 task 4☆59Updated 2 years ago
- Speech separation with utterance-level PIT experiments☆104Updated 6 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- An open-source speech separation and enhancement library☆211Updated 5 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 3 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆79Updated 7 years ago
- ☆226Updated 5 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆127Updated 4 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Updated 6 years ago
- A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation☆135Updated 7 years ago
- Benchmark popular audio i/o packages☆141Updated last year
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago