sadPororo/AD-YOLO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sadPororo/AD-YOLO)

sadPororo / AD-YOLO

AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023

☆35

Alternatives and similar repositories for AD-YOLO

Users that are interested in AD-YOLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆23Jul 17, 2024Updated 2 years ago
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
winddori2002 / MANNER
View on GitHub
MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)
☆65Aug 29, 2022Updated 3 years ago
chrschy / pilot
View on GitHub
☆19Jun 10, 2021Updated 5 years ago
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
soonhyeon / Noisy-ArcMix
View on GitHub
Noisy-ArcMix: Additive Noisy Angular Margin Loss Combined With Mixup for Anomalous Sound Detection
☆31Aug 22, 2024Updated last year
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
yinkalario / EIN-SELD
View on GitHub
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
☆79Aug 5, 2021Updated 4 years ago
sharathadavanne / seld-dcase2022
View on GitHub
Baseline method for sound event localization task of DCASE 2022 challenge
☆64Jun 21, 2022Updated 4 years ago
kinggongzilla / DCASE2023_Task2
View on GitHub
☆23May 15, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sharathadavanne / seld-dcase2021
View on GitHub
Baseline method for sound event localization task of DCASE 2021 challenge
☆45Jun 15, 2021Updated 5 years ago
yihongXU / DAUMOT
View on GitHub
Official Implementation for DAUMOT: Domain Adaptation for Unsupervised Multiple Object Tracking, An unsupervised MOT training framework w…
☆12Mar 14, 2022Updated 4 years ago
gyx-gloria / DMT
View on GitHub
Official Implementation of DMT: Dual Mean-Teacher in PyTorch.
☆10Oct 27, 2023Updated 2 years ago
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆71Mar 13, 2023Updated 3 years ago
AppliedAcousticsChalmers / ambisonic-encoding
View on GitHub
Example MATLAB/Octave scripts to perform ambisonic encoding of microphone array signals
☆44May 23, 2026Updated 2 months ago
wilkinghoff / ssl4asd
View on GitHub
Code for the paper "Self-Supervised Learning for Anomalous Sound Detection"
☆43May 13, 2024Updated 2 years ago
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
PeiwenSun2000 / Both-Ears-Wide-Open
View on GitHub
The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
☆65Jul 2, 2025Updated last year
JaniceWuo / PoetryQA
View on GitHub
结合知识图谱做的有关诗词的问答demo
☆11Mar 11, 2020Updated 6 years ago
frednam93 / FDY-SED
View on GitHub
☆96Jun 22, 2023Updated 3 years ago
liuyoude / STgram-MFN
View on GitHub
A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection
☆107Mar 28, 2023Updated 3 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆23Updated this week
Zumbalamambo / multi-track_particle_filtering
View on GitHub
Tracking multiple objects with systematic re-sampling particle filtering
☆14Jun 26, 2017Updated 9 years ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
zhaoyanpeng / audioset-dl
View on GitHub
Download AudioSet for Vision-Audio-Text Pre-training
☆13May 16, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
thomeou / General-network-architecture-for-sound-event-localization-and-detection
View on GitHub
This repository consists of python code to train sound event localization and detection models.
☆22Jan 21, 2021Updated 5 years ago
Whale-Yu / garbage-sorting-pytorch
View on GitHub
垃圾分类pytorch
☆16Apr 13, 2023Updated 3 years ago
nilseuropa / hopenet_ncnn
View on GitHub
Hopenet: deep head pose estimator on ncnn
☆10Jun 18, 2020Updated 6 years ago
mtanveer1 / AVSEC-3-Challenge
View on GitHub
Audio-Visual Speech Enhancement Challenge (AVSE) 2024
☆12Feb 6, 2026Updated 5 months ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago