qiuqiangkong / panns_transfer_to_gtzan
☆103Updated 4 years ago
Alternatives and similar repositories for panns_transfer_to_gtzan:
Users that are interested in panns_transfer_to_gtzan are comparing it to the libraries listed below
- Domestic environment sound event detection task☆138Updated 7 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆126Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆132Updated 6 months ago
- ☆53Updated 4 years ago
- Code for DCASE 2020 task 1a and task 1b.☆86Updated 3 years ago
- ☆207Updated 10 months ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆68Updated 3 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆100Updated last year
- Paderborn Sound Event Detection☆72Updated last year
- Baseline of dcase 2019 task 4☆58Updated 2 years ago
- General purpose sound recognition demo☆152Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆38Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆127Updated 2 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆115Updated 3 months ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 4 years ago
- Visualization toolbox for Sound Event Detection☆118Updated 11 months ago
- ☆81Updated last year
- ☆63Updated 4 months ago
- CP-JKU submission to DCASE 19, performant single-model CNN☆56Updated 4 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆142Updated last year
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆110Updated 2 years ago
- CP-JKU submission to DCASE 20☆43Updated 3 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆164Updated last year
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆61Updated 3 years ago
- Pytorch port of Google Research's LEAF Audio paper☆92Updated 3 years ago
- ☆41Updated 5 months ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144Updated 2 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆72Updated 4 years ago