soham97 / awesome-sound_event_detectionView external linksLinks
Reading list for research topics in Sound AI
☆196Aug 8, 2024Updated last year
Alternatives and similar repositories for awesome-sound_event_detection
Users that are interested in awesome-sound_event_detection are comparing it to the libraries listed below
Sorting:
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆158Aug 24, 2025Updated 5 months ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆470Sep 18, 2025Updated 4 months ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- Polyphonic Sound Detection Score (PSDS)☆15Jan 20, 2020Updated 6 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Nov 9, 2022Updated 3 years ago
- Paderborn Sound Event Detection☆78Jul 18, 2023Updated 2 years ago
- ☆27Oct 17, 2024Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Domestic environment sound event detection task☆155Jun 11, 2024Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆46Feb 20, 2022Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆144Jul 16, 2024Updated last year
- ☆113May 13, 2025Updated 9 months ago
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆328Nov 20, 2024Updated last year
- ☆60Jul 2, 2024Updated last year
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆131Jul 24, 2020Updated 5 years ago
- ☆95Jun 22, 2023Updated 2 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Visualization toolbox for Sound Event Detection☆124Feb 26, 2024Updated last year
- ☆37Jul 4, 2024Updated last year
- ☆39Jan 19, 2026Updated 3 weeks ago
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Jan 10, 2023Updated 3 years ago
- ☆1,662Jul 25, 2024Updated last year
- Evaluation toolbox for Sound Event Detection☆157Jun 12, 2024Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 3 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- ☆13Jan 3, 2024Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆43Nov 10, 2021Updated 4 years ago
- ☆28Mar 14, 2023Updated 2 years ago
- Audio Captioning datasets for PyTorch.☆126Jul 18, 2025Updated 6 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,419May 21, 2023Updated 2 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆198Jun 21, 2022Updated 3 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆74Aug 24, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- ☆16Jun 12, 2025Updated 8 months ago