soham97/awesome-sound_event_detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/soham97/awesome-sound_event_detection)

soham97 / awesome-sound_event_detection

Reading list for research topics in Sound AI

☆201

Alternatives and similar repositories for awesome-sound_event_detection

Users that are interested in awesome-sound_event_detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Audio-WestlakeU / ATST-SED
View on GitHub
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
☆174Jun 8, 2026Updated last month
jiwonix / Sound-Event-Detection-papers
View on GitHub
Sound Event Detection (SED) paper collection
☆15Jun 26, 2024Updated 2 years ago
soham97 / MTL_Weakly_labelled_audio_data
View on GitHub
Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"
☆17Nov 9, 2022Updated 3 years ago
RetroCirce / HTS-Audio-Transformer
View on GitHub
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
☆504Sep 18, 2025Updated 10 months ago
fgnt / pb_sed
View on GitHub
Paderborn Sound Event Detection
☆80Jul 18, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
robertanto / Real-Time-Sound-Event-Detection
View on GitHub
This repository contains the python implementation of a Sound Event Detection systems working in real time.
☆75Oct 10, 2022Updated 3 years ago
jim-schwoebel / sound_event_detection
View on GitHub
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆47Feb 20, 2022Updated 4 years ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
DCASE-REPO / DESED_task
View on GitHub
Domestic environment sound event detection task
☆157Jun 11, 2024Updated 2 years ago
sithu31296 / audio-tagging
View on GitHub
Easy to use Audio Tagging in PyTorch
☆23Aug 22, 2021Updated 4 years ago
Kikyo-16 / Sound_event_detection
View on GitHub
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…
☆129Jul 24, 2020Updated 6 years ago
frednam93 / FDY-SED
View on GitHub
☆96Jun 22, 2023Updated 3 years ago
j-bernardi / psds_eval
View on GitHub
Polyphonic Sound Detection Score (PSDS)
☆20Jan 20, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
turpaultn / DESED
View on GitHub
Repo associated to the DESED dataset, download and creation of data
☆155Jul 16, 2024Updated 2 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
fgnt / sed_scores_eval
View on GitHub
☆41Feb 18, 2026Updated 5 months ago
dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
c4dm / dcase-few-shot-bioacoustic
View on GitHub
☆61Jul 2, 2024Updated 2 years ago
fschmid56 / PretrainedSED
View on GitHub
☆145May 13, 2025Updated last year
qiuqiangkong / sound_event_detection_dcase2017_task4
View on GitHub
☆55Jun 3, 2020Updated 6 years ago
Anaesthesiaye / sound_event_detection_transformer
View on GitHub
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆46May 9, 2022Updated 4 years ago
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 10 months ago
Ming-er / LGC-SED
View on GitHub
☆13Jan 3, 2024Updated 2 years ago
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,766Jul 25, 2024Updated 2 years ago
Jungjee / DcaseNet
View on GitHub
Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…
☆45Nov 10, 2021Updated 4 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
TUT-ARG / sed_vis
View on GitHub
Visualization toolbox for Sound Event Detection
☆122Feb 26, 2024Updated 2 years ago
soham97 / sound_ai_progress
View on GitHub
Tracking states of the arts and recent results (bibliography) on sound tasks.
☆33Jan 10, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
TUT-ARG / sed_eval
View on GitHub
Evaluation toolbox for Sound Event Detection
☆161Jun 12, 2024Updated 2 years ago
sharathadavanne / seld-net
View on GitHub
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…
☆405Nov 21, 2022Updated 3 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
dr-costas / dnd-sed
View on GitHub
Sound event detection with depthwise separable and dilated convolutions.
☆53Mar 30, 2020Updated 6 years ago
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
sharathadavanne / sed-crnn
View on GitHub
Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…
☆202Jun 21, 2022Updated 4 years ago
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆428Aug 14, 2022Updated 3 years ago