partha2409/DCASE2024_seld_baseline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/partha2409/DCASE2024_seld_baseline)

partha2409 / DCASE2024_seld_baseline

☆52

Alternatives and similar repositories for DCASE2024_seld_baseline

Users that are interested in DCASE2024_seld_baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆70Mar 13, 2023Updated 3 years ago
yxdong0320 / Solution_on_3D_SELD
View on GitHub
The program ranked first in Audio-only track of DCASE2024 Challenge task3.
☆22Mar 2, 2026Updated 4 months ago
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
Jinbo-Hu / DCASE2022-TASK3
View on GitHub
☆37Nov 14, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆46Sep 17, 2025Updated 10 months ago
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
sadPororo / AD-YOLO
View on GitHub
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023
☆35Dec 21, 2025Updated 6 months ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
Jinbo-Hu / L3DAS22-TASK2
View on GitHub
A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection
☆23Nov 14, 2024Updated last year
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆35Feb 11, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Hong-Hengyi / MVANet-SELD
View on GitHub
For more detailed information, please refer to the paper titled "MVANet: Multi-Stage Video Attention Network for Sound Event Localization…
☆35May 20, 2025Updated last year
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
Orlllem / seld_wav2vec2
View on GitHub
☆18Feb 1, 2026Updated 5 months ago
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
yinkalario / EIN-SELD
View on GitHub
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
☆79Aug 5, 2021Updated 4 years ago
chrschy / pilot
View on GitHub
☆19Jun 10, 2021Updated 5 years ago
apple / ml-spatial-librispeech
View on GitHub
A large synthetic dataset of spatial audio with multiple labels
☆126Oct 25, 2023Updated 2 years ago
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆24Apr 14, 2024Updated 2 years ago
thomeou / SALSA
View on GitHub
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
☆114May 31, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sharathadavanne / seld-net
View on GitHub
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…
☆402Nov 21, 2022Updated 3 years ago
yinkalario / DCASE2019-TASK3
View on GitHub
Our DCASE 2019 challenge task 3 method
☆32Jan 17, 2023Updated 3 years ago
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated 2 weeks ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
afrancl / BinauralLocalizationCNN
View on GitHub
Code to create networks that localize sounds sources in 3D environments
☆53Jan 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
thomeou / General-network-architecture-for-sound-event-localization-and-detection
View on GitHub
This repository consists of python code to train sound event localization and detection models.
☆22Jan 21, 2021Updated 5 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
DCASE2024-Task7-Sound-Scene-Synthesis / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆13Mar 27, 2024Updated 2 years ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
sharathadavanne / seld-dcase2022
View on GitHub
Baseline method for sound event localization task of DCASE 2022 challenge
☆64Jun 21, 2022Updated 4 years ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago