axeber01/ngcc-seld

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/axeber01/ngcc-seld)

axeber01 / ngcc-seld

Sound Event Localization and Detection using Neural Generalized Cross-Correlations

☆36

Alternatives and similar repositories for ngcc-seld

Users that are interested in ngcc-seld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
Jinbo-Hu / DCASE2022-TASK3
View on GitHub
☆37Nov 14, 2024Updated last year
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆22Jul 17, 2024Updated 2 years ago
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
axeber01 / ngcc
View on GitHub
Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654
☆37Feb 11, 2025Updated last year
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated 3 weeks ago
Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
thomeou / SALSA
View on GitHub
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
☆114May 31, 2022Updated 4 years ago
sadPororo / AD-YOLO
View on GitHub
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023
☆35Dec 21, 2025Updated 7 months ago
juliawilkins / ambisonics2binaural_simple
View on GitHub
A simple Python script to convert FOA audio to binaural.
☆17Nov 29, 2022Updated 3 years ago
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
egrinstein / neural_srp
View on GitHub
The Neural-SRP method for DOA estimation
☆37May 24, 2024Updated 2 years ago
Hong-Hengyi / MVANet-SELD
View on GitHub
For more detailed information, please refer to the paper titled "MVANet: Multi-Stage Video Attention Network for Sound Event Localization…
☆35May 20, 2025Updated last year
vlsi-nanocomputing / dynamic-sound
View on GitHub
DynamicSound Simulator is a modular Python library for generating virtual acoustic scenes with configurable microphones, sound sources, a…
☆18Jul 15, 2026Updated last week
yxdong0320 / Solution_on_3D_SELD
View on GitHub
The program ranked first in Audio-only track of DCASE2024 Challenge task3.
☆22Mar 2, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
FYJNEVERFOLLOWS / Awesome-Sound-Source-Localization
View on GitHub
A tutorial for Sound Source Localization researchers and practitioners. The purpose of this repo is to organize the world’s resources for…
☆59Mar 17, 2023Updated 3 years ago
egrinstein / gnn_ssl
View on GitHub
Graph Neural Networks for Sound Source Localization
☆29Oct 31, 2023Updated 2 years ago
DavidDiazGuerra / icoCNN
View on GitHub
Pytorch implementation of the icosahedral CNNs
☆21Apr 24, 2023Updated 3 years ago
HauLiang / DAMAS-FISTA-Net
View on GitHub
Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming
☆21Aug 20, 2024Updated last year
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆71Mar 13, 2023Updated 3 years ago
Devin-Pi / uncertainty-estimation-for-ssl
View on GitHub
This repo is for the paper "Uncertainty Estimation for Sound Source Localization".
☆15Mar 13, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
sholokhovalexey / active-noise-control
View on GitHub
Active noise controller (ANC) design: a practical primer
☆15Jan 8, 2026Updated 6 months ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
Soumitro-Chakrabarty / Single-speaker-localization
View on GitHub
CNN based single speaker localization
☆50Aug 28, 2020Updated 5 years ago
viduzz84 / SubbandAdaptiveX
View on GitHub
Subband Adaptive System with Crossterms for aliasing reduction
☆18Jul 31, 2022Updated 3 years ago
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆25Apr 14, 2024Updated 2 years ago
sharathadavanne / multiple-target-tracking
View on GitHub
Tracking unknown number of 2D targets/sources
☆62Nov 20, 2020Updated 5 years ago