cai525 / Transformer4SEDView external linksLinks
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆88Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for Transformer4SED
Users that are interested in Transformer4SED are comparing it to the libraries listed below
Sorting:
- ☆19Mar 6, 2025Updated 11 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- ☆22Mar 19, 2025Updated 10 months ago
- ☆27Oct 17, 2024Updated last year
- Domestic environment sound event detection task☆155Jun 11, 2024Updated last year
- ☆113May 13, 2025Updated 9 months ago
- ☆18Aug 16, 2025Updated 5 months ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- ☆67Sep 13, 2024Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Repo associated to the DESED dataset, download and creation of data☆144Jul 16, 2024Updated last year
- ☆13Jan 3, 2024Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆136Sep 25, 2025Updated 4 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Feb 6, 2026Updated last week
- Here is a repository stored the classical sound source localization algorithms in spherical domain, namely, PWD, DAS, SHMUSIC, SHMVDR, S…☆21Nov 16, 2023Updated 2 years ago
- ☆95Jun 22, 2023Updated 2 years ago
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆21Jul 17, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆28Oct 25, 2024Updated last year
- ClickAttention: Click Region Similarity Guided Interactive Segmentation☆23Jan 3, 2025Updated last year
- ☆20Apr 11, 2019Updated 6 years ago
- ☆49Apr 4, 2025Updated 10 months ago
- Visualization toolbox for Sound Event Detection☆124Feb 26, 2024Updated last year
- ☆60Jul 2, 2024Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆56Jun 12, 2025Updated 8 months ago
- ☆11Sep 25, 2024Updated last year
- System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection☆12Sep 5, 2024Updated last year
- ☆10Oct 16, 2025Updated 3 months ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆31Oct 15, 2025Updated 3 months ago
- CVPR 2025 Workshop on CVEU.☆42Jun 12, 2025Updated 8 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆28Mar 14, 2023Updated 2 years ago
- [ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of LSTM, RezoNet and Hybrid CNNs-BiLSTM Architecture" by Nhut Mi…☆10Jan 16, 2025Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework