Audio-WestlakeU / ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
☆123Updated 5 months ago
Alternatives and similar repositories for ATST-SED:
Users that are interested in ATST-SED are comparing it to the libraries listed below
- A library built for easier audio self-supervised training, downstream tasks evaluation☆114Updated 6 months ago
- Domestic environment sound event detection task☆142Updated 9 months ago
- ☆83Updated last year
- ☆62Updated 6 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆41Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆35Updated last month
- Source code for Consistent ensemble distillation for audio tagging☆27Updated 8 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated this week
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆183Updated 3 months ago
- Sound Event Detection (SED) paper collection☆13Updated 8 months ago
- Baseline method for sound event localization task of DCASE 2023 challenge☆47Updated 2 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆26Updated last year
- ☆100Updated last year
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆57Updated 4 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆58Updated 10 months ago
- Repo associated to the DESED dataset, download and creation of data☆137Updated 8 months ago
- ☆18Updated 2 weeks ago
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆133Updated 3 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Paderborn Sound Event Detection☆73Updated last year
- ☆189Updated last year
- ☆50Updated last year
- ☆30Updated 8 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 2 months ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆51Updated this week
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆118Updated 2 years ago
- ☆47Updated 2 years ago
- ☆34Updated 9 months ago
- ☆32Updated 4 months ago