jim-schwoebel / download_audiosetLinks
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
☆103Updated last year
Alternatives and similar repositories for download_audioset
Users that are interested in download_audioset are comparing it to the libraries listed below
Sorting:
- Repo associated to the DESED dataset, download and creation of data☆139Updated 10 months ago
- Paderborn Sound Event Detection☆74Updated last year
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆72Updated 3 years ago
- Domestic environment sound event detection task☆144Updated 11 months ago
- ☆53Updated 5 years ago
- Baseline systems for the FSD50K dataset☆69Updated 3 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- ☆107Updated 4 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆129Updated 4 years ago
- Visualization toolbox for Sound Event Detection☆120Updated last year
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Updated 6 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆170Updated last year
- ☆65Updated 8 months ago
- Baseline of dcase 2019 task 4☆58Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆55Updated 4 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆118Updated 9 months ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆99Updated 3 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆131Updated this week
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- CP-JKU submission to DCASE 19, performant single-model CNN☆57Updated 4 years ago
- ☆46Updated 9 months ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- CP-JKU submission to DCASE 20☆44Updated 4 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆84Updated 6 years ago