yxdong0320/Solution_on_3D_SELD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yxdong0320/Solution_on_3D_SELD)

yxdong0320 / Solution_on_3D_SELD

The program ranked first in Audio-only track of DCASE2024 Challenge task3.

☆23

Alternatives and similar repositories for Solution_on_3D_SELD

Users that are interested in Solution_on_3D_SELD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆28May 27, 2025Updated last year
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆47Sep 17, 2025Updated 10 months ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
Hong-Hengyi / MVANet-SELD
View on GitHub
For more detailed information, please refer to the paper titled "MVANet: Multi-Stage Video Attention Network for Sound Event Localization…
☆35May 20, 2025Updated last year
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
JHU-LCAP / FlexSED
View on GitHub
open-vocabulary sound event detection
☆53Dec 17, 2025Updated 7 months ago
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆23Jul 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fschmid56 / PretrainedSED
View on GitHub
☆145May 13, 2025Updated last year
yuhanghe01 / Sound3DVDet
View on GitHub
Code for WACV24 work for multiview acoustic-visual detection
☆13Mar 22, 2024Updated 2 years ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
BrownsugarZeer / Multi_SSL
View on GitHub
Combine sound source separation with SRP-PHAT to achieve multi-source localization.
☆97Jan 22, 2025Updated last year
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
donghoney0416 / DeFT-AN
View on GitHub
Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…
☆38Jan 26, 2026Updated 6 months ago
Exgc / OmniSep
View on GitHub
Sound Separation, Omni modal
☆29Sep 15, 2025Updated 10 months ago
linjac / GenDARA
View on GitHub
☆13Jan 14, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
theMoro / EfficientSED
View on GitHub
☆22Jun 12, 2025Updated last year
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆71Mar 13, 2023Updated 3 years ago
iclr2024mcmi / ICLRMCMI
View on GitHub
Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information
☆12Sep 28, 2023Updated 2 years ago
FYJNEVERFOLLOWS / Paper-Reading-Notes
View on GitHub
Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverbera…
☆30Jun 8, 2023Updated 3 years ago
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆21Sep 18, 2025Updated 10 months ago
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆23Oct 10, 2025Updated 9 months ago
Robiwan245 / SiamMAE
View on GitHub
☆12Mar 5, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tangzhengxu2001 / m4oe
View on GitHub
☆16Apr 3, 2025Updated last year
JinXins / SUMix
View on GitHub
About Official PyTorch(MMCV) implementation of “SUMix: Mixup with Semantic and Uncertain Information” (ECCV 2024)
☆12Sep 2, 2024Updated last year
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
jimbozhang / xares-llm-template
View on GitHub
Template for creating audio encoders compatible with X-ARES
☆19Feb 11, 2026Updated 5 months ago
PoKoHA / Speech_Enhancement-DCCRN
View on GitHub
DCCRN: Deep Complex Convolution Recurrent Network
☆14Nov 26, 2021Updated 4 years ago
seorim0 / SE-using-SRL-Model
View on GitHub
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
☆21Jun 6, 2025Updated last year
Alizeded / BERP
View on GitHub
[TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters
☆21Aug 16, 2025Updated 11 months ago