Anaesthesiaye / sound_event_detection_transformer
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆35Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sound_event_detection_transformer
- ☆62Updated last month
- ☆78Updated last year
- ☆31Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆97Updated 3 weeks ago
- ☆23Updated last year
- ☆28Updated 4 months ago
- ☆29Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆14Updated 3 months ago
- ☆18Updated 2 years ago
- ☆53Updated 4 years ago
- ☆45Updated last year
- Paderborn Sound Event Detection☆69Updated last year
- Domestic environment sound event detection task☆127Updated 4 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆28Updated 3 weeks ago
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- ☆27Updated 4 months ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆39Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆26Updated 2 years ago
- ☆48Updated 2 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆63Updated 2 years ago
- ☆26Updated 10 months ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆49Updated 3 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆105Updated 2 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆44Updated 2 years ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆29Updated last year
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆95Updated 2 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆42Updated 2 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆22Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆59Updated 2 years ago
- ☆26Updated last year