Reading list for research topics in Sound AI
☆196Aug 8, 2024Updated last year
Alternatives and similar repositories for awesome-sound_event_detection
Users that are interested in awesome-sound_event_detection are comparing it to the libraries listed below
Sorting:
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆159Aug 24, 2025Updated 6 months ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆474Sep 18, 2025Updated 5 months ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- Polyphonic Sound Detection Score (PSDS)☆15Jan 20, 2020Updated 6 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Nov 9, 2022Updated 3 years ago
- Paderborn Sound Event Detection☆78Jul 18, 2023Updated 2 years ago
- ☆28Oct 17, 2024Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Domestic environment sound event detection task☆155Jun 11, 2024Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆46Feb 20, 2022Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆144Jul 16, 2024Updated last year
- ☆114May 13, 2025Updated 9 months ago
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆331Nov 20, 2024Updated last year
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆131Jul 24, 2020Updated 5 years ago
- ☆60Jul 2, 2024Updated last year
- ☆95Jun 22, 2023Updated 2 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Visualization toolbox for Sound Event Detection☆123Feb 26, 2024Updated 2 years ago
- ☆37Jul 4, 2024Updated last year
- ☆40Feb 18, 2026Updated 2 weeks ago
- Efficient Training of Audio Transformers with Patchout☆370Jan 12, 2024Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Jan 10, 2023Updated 3 years ago
- ☆1,669Jul 25, 2024Updated last year
- Evaluation toolbox for Sound Event Detection☆158Jun 12, 2024Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 4 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- ☆13Jan 3, 2024Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆43Nov 10, 2021Updated 4 years ago
- ☆28Mar 14, 2023Updated 2 years ago
- Audio Captioning datasets for PyTorch.☆127Jul 18, 2025Updated 7 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,426May 21, 2023Updated 2 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆199Jun 21, 2022Updated 3 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year